ch-a-os / DocSort

Digitize and access everything, everywhere.
GNU General Public License v3.0
0 stars 1 forks source link

Feature: Text-Recognition #14

Open ghost opened 5 years ago

ghost commented 5 years ago

Implement a way (lib?) to let uploaded documents be scanned for text (OCR) which is then saved into a string-field for that document. This feature is a little bit more complex, because the ocr-scan can take some time and there need to be mechanisms to recognize a failed or aborted scan and recover from that. Also the user should be able to select "ocr-scan: yes/no" on each upload, also he should be able to set a prefered default-state of that toggle-switch in his userprofile-settings.

ghost commented 5 years ago

I changed the title to "FEATURE: 'Blocked by predecessors' field in work packages" because, as discussed, there are more ways to scan a document than OCR. A pdf-file, for example, may have text-areas embedded in addition to image-content. We can/should scan for both, which get's a little bit more complex when combining both but it should work.