-
These are commented as `FIXME` at the end of `hocr-check`, I'll put them here for discussion.
- [ ] containment of paragraphs, columns, etc.
- [ ] ocr-recognized vs. actual tags
- [ ] warn about text …
-
I have set the `output_type` to `hocr`.
But where can I find it? I would expect the output to be stored somewhere. I read in the Tesseract documentation it is possible.
-
First off, thanks for an awesome piece of software. For the most part, it works great!
For some reason, after converting many thousands of pages, I've come across this error for one page only:
g…
-
```
What steps will reproduce the problem?
1. Use an input image at a rotation (e.g., 90 degrees)
2. Output the text to hocr
What is the expected output? What do you see instead?
The hocr output shou…
-
```
What steps will reproduce the problem?
Run tesseract 000000.tif 000000 -l pol+deu-frak hocr
on http://fleksem.klf.uw.edu.pl/~jsbien/tesseract_empty-words/000000.tif
What is the expected output?…
-
```
What steps will reproduce the problem?
1.Run tesseract with tessedit_create_hocr 1
2.Run tesseract with both tessedit_create_hocr 1 and hocr_font_info 1
3. Compare the 'ocr-capabilities list
…
-
```
What steps will reproduce the problem?
1.Run tesseract with tessedit_create_hocr 1
2.Run tesseract with both tessedit_create_hocr 1 and hocr_font_info 1
3. Compare the 'ocr-capabilities list
…
-
```
What steps will reproduce the problem?
1. Use an input image at a rotation (e.g., 90 degrees)
2. Output the text to hocr
What is the expected output? What do you see instead?
The hocr output shou…
-
```
What steps will reproduce the problem?
Run tesseract 000000.tif 000000 -l pol+deu-frak hocr
on http://fleksem.klf.uw.edu.pl/~jsbien/tesseract_empty-words/000000.tif
What is the expected output?…
-
```
What steps will reproduce the problem?
1. Use an input image at a rotation (e.g., 90 degrees)
2. Output the text to hocr
What is the expected output? What do you see instead?
The hocr output shou…