-
**What steps does it take to reproduce the issue?**
* When does this issue occur?
When trying to set up an action to extract hocr
* Which page does it occur on?
http://localhost:8000/admin/con…
-
Using `reportlab>=4.1.0` breaks the `hocr_pdf` tool, as the text layer is not being generated any more.
-
ANTs had a report of ImageMath crashing within a fMRIprep container: https://github.com/ANTsX/ANTs/issues/1204
It was narrowed down to likely the container ANTs was built with hardware support the …
-
### Environment
* **Tesseract Version**: 4.0.0 ~~4.0.0-beta.1 from https://packages.debian.org/stretch-backports/tesseract-ocr~~
* **Commit Number**: 51316994ccae0b48692d547030f26c0969308214 ~~c3e…
nezda updated
2 years ago
-
**Overview of feature request**
Is Features the right solution for Islandora Core Feature?
Do the affordances of Features (namely: updates) fit with how we are using this module in practice? D…
-
I said that I'm using Podman quadlet in #1068.
For this issue, I used this .container file
```
[Container]
Image=docker.io/frooodle/s-pdf:latest
AutoUpdate=registry
PublishPort=8080:8080
Vo…
-
This problem occurs with 11 out of a set of ~646 PNGs, all of which plopped out of the exact same processing pipeline, scanned on exactly the same hardware.
Both models (seg & rec) trained from bin…
-
### Describe the bug
So i got 'lots of diacritics - possibly poor OCR', i ran the output pdf and tried selecting text, some text weren't being selected. So i tried using tesseract on them `grimblas…
-
Tesseract has always included its own, internal binarization – which is **not** based on Leptonica and is of rather bad quality (custom global Otsu implementation without normalization). Leptonica doe…
-
I updated to the latest version of Workbench. My CSV includes 4 columns for additional media files, called PDF, mediatrack, HOCR, and transcript. These are defined in my YML with their Media Use term …
dara2 updated
7 months ago