Searchable PDF OCR in the browser using Tesseract.js

(demopdf.deno.dev)

2 points | by photoncat 7 hours ago

2 comments

  • photoncat 6 hours ago
    FYI- I built a simple tool that lets you OCR PDFs entirely in your browser using Tesseract.js and PDF.js. No server required - all processing happens client-side. this tool creates a text transparent layer that preserves original document layout using bounding box coordinates like most ocr tools.