mozilla / pdf.js

PDF Reader in JavaScript
https://mozilla.github.io/pdf.js/
Apache License 2.0
48.84k stars 10.04k forks source link

[Feature]: OCR Support for Non-Selectable Text in PDF Documents #18949

Closed rossanodr closed 1 month ago

rossanodr commented 1 month ago

Is the feature relevant to the Firefox PDF Viewer?

No

Feature description

I couldn't find a current implementation or any updates on OCR (Optical Character Recognition) support for documents with non-selectable text in pdf.js. I came across some old issues discussing the topic, but there hasn't been a clear follow-up.

Could you provide an update on whether there are plans to include OCR support for scanned documents or image-based PDFs in pdf.js? This feature would be highly valuable for making non-selectable text searchable and accessible.

Thank you!

Other PDF viewers

No response

Snuffleupagus commented 1 month ago

Duplicate of #15843