-
### Current Behavior
I used tesseract 5.4.1 in WSL/Win10 and tesseract 5.0.1 in GImagereader/Win10 with different image files (fraktur newspaper and latin/Libreoffice dokument, 2 columns, all images…
-
For reproduction steps see the workaround for https://github.com/OurDigitalWorld/hocrmod/issues/1
On the right top is the text 'print' that still isn't found by this script.
```
python hocrmod.py…
rmast updated
1 month ago
-
[hOCR](https://www.wikiwand.com/en/HOCR) is an open standard of data representation for formatted text obtained from optical character recognition (OCR). The definition encodes text, style, layout inf…
-
Add download options, if a canvas has associated seeAlso resources, e.g. hOCR:
example manifest with hOCR:
https://api.digitale-sammlungen.de/iiif/presentation/v2/bsb11659582/manifest
```
"see…
-
Tesseract Version: v5.0.0-alpha.20190623
Platform: Windows 10 64-bit
Current Behavior: For the Thai language (almost) every individual character in hOCR output is a word
Expected Behavior: Words (…
FrkBo updated
1 month ago
-
xUNC seems to miss the last wavelength here:
https://github.com/nasa/HyperCP/blob/8ed6754e0949641121e19bb24a413fb8e921c742/Source/ProcessL2.py#L522-L538
Comments in the code indicate that this err…
-
```
Thanks for developing OCRFeeder.
Do you have any plans to include "export to hOCR" functionality?
Together with hOCR export and automatically stitching the files back into the
recognized PDF do…
-
```
Thanks for developing OCRFeeder.
Do you have any plans to include "export to hOCR" functionality?
Together with hOCR export and automatically stitching the files back into the
recognized PDF do…
-
Hi there,
i'd really like if there was a feature to use already existing hocr/html data from tesseract. This would allow to run tesseract seperately (for whatever reason) and reuse this for djvubind …
-
See #11 related to this and all the work that Giancarlo has been doing in the last week
The plan
1. Open an issue for all this ✔️
2. Code an endpoint with configuration (because i need to know w…