FYI- I built a simple tool that lets you OCR PDFs entirely in your browser using Tesseract.js and PDF.js. No server required - all processing happens client-side. this tool creates a text transparent layer that preserves original document layout using bounding box coordinates like most ocr tools.
2 comments