cyanfish / naps2

Scan documents to PDF and more, as simply as possible.
https://www.naps2.com
Other
2.82k stars 327 forks source link

OCR segmentation #258

Open raindropsfromsky opened 10 months ago

raindropsfromsky commented 10 months ago

Is your feature request related to a problem? Please describe. NAPS2 decides the type of content automatically, and OCRs the "text" and "table" type of blocks, while reproduces the "image" type of blocks without OCR. However, there is a problem: Some images contain embedded text. Such images are also treated as text blocks. The result is suboptimal.

Describe the solution you'd like

Let NAPS2 show the default block types to the user, and allow user to edit those blocks.

All professional OCR software have this feature.

Describe alternatives you've considered A clear and concise description of any alternative solutions or features you've considered.

Additional context Add any other context or screenshots about the feature request here.

kintaro1981 commented 10 months ago

@raindropsfromsky I reported the same problem in #242 in the meanwhile have you solved with some alternatives?

EmperorArthur commented 9 months ago

While not the exact same issue, related to #208.