-
Currently _zotero-ocr_ requires additional installation steps for `pdftoppm` and `tesseract`.
Both could be replaced by pure JavaScript implementations which could be included in _zotero-ocr_ to si…
-
### Requested feature
Our users encounter from time to time documents that instead of text have vector path's representing themselves as text.
Because of the vector nature of it, we do not automat…
-
### Requested feature
We need to have a way to add a timeout parameter when processing a document. Currently, it happens in very rare cases that certain documents will take very long to convert. In…
-
I'm using windows and vue.js
`Audiveris stderr: Error opening data file C:\Program Files\tesseract-ocr\tessdata/deu.traineddata
Please make sure the TESSDATA_PREFIX environment variable is set to …
-
What technology would I need?
What coding would this need?
Do I need AI to parse whatever is scanned?
-
This issue continues that part of #16 about OCR, but with other files.
[Two files](https://file.io/6mUrQvhM4xf6). File Kornai. I can correctly copy text from djvu file in DjVu4, but not in Ocular…
-
### Summer/early fall
REVIEW state by October 5
**NEW**
- version date 2024-10-31
- version n 6.0.0
- [ ] OCR'd material in GitDox (requires sentence splitting, auto-NLP); see [OCR google d…
-
I believe it would be helpful if the ocrd-fileformat-transform PAGE → ALTO transformation would add a `` tag. I looked into to the file to figure out if https://github.com/kba/page-to-alto was used fo…
-
are we done with this? see #27
-
### What features would you like to see added?
I would like the ability to upload PDF files and have them automatically transformed into images. These images should then be processed by the integrate…