unidoc / unipdf

Golang PDF library for creating and processing PDF files (pure go)
https://unidoc.io
Other
2.46k stars 250 forks source link

[FEATURE]Is there a plan to support scanned PDFs? #526

Open geekr-dev opened 9 months ago

github-actions[bot] commented 9 months ago

Welcome! Thanks for posting your first issue. The way things work here is that while customer issues are prioritized, other issues go into our backlog where they are assessed and fitted into the roadmap when suitable. If you need to get this done, consider buying a license which also enables you to use it in your commercial products. More information can be found on https://unidoc.io/

sampila commented 9 months ago

Hi @geekr-dev,

Could you explain more regarding the support of scanned PDFs? Is that text extraction from the scanned PDF files?

geekr-dev commented 9 months ago

Hi @geekr-dev,

Could you explain more regarding the support of scanned PDFs? Is that text extraction from the scanned PDF files?

yes,in addition, there are some non-standard PDF files that have garbled text during parsing: test1.pdf test2.pdf

sampila commented 9 months ago

Hi @geekr-dev, Could you explain more regarding the support of scanned PDFs? Is that text extraction from the scanned PDF files?

yes,in addition, there are some non-standard PDF files that have garbled text during parsing: test1.pdf test2.pdf

Thanks for sharing the PDF files, we will investigate this issue