-
V súvislosti s aktualizáciou Štandardu pre digitalizáciu monografií pripravujeme aj možnú aktualizáciu formátu ALTO zo staršej verzie 2.0 na najnovšiu verziu 4.2. Domnievame sa, že prechod na najnovši…
-
:warning: crazy idea :warning:
In https://beehind.org , we have an illustration with image connected to text boxes. This illustration points to parts of an image and associates it with something e…
-
# What is this?
Look kids: https://mirador-textoverlay.netlify.app
And source code https://github.com/dbmdz/mirador-textoverlay
This is a custom Mirador 3 plugin built by Wizard and Techno con…
-
"Since we would like to keep paragraphs but get rid of line breaks, we need to find a way of identifying paragraphs which is not possible from the text itself. One approach was to annotate paragraphs …
-
1) Add a period at the end of "Aspyre GUI is a simple application to make Aspyre GT available as a service online"
2) Change "compatible for import" to "compatible with importation" in the following …
-
Hei!
I tried to run something like
```
java -cp ocrevaluation.jar eu.digitisation.Main \
-gt {ground_truth_file} [{encoding}] \
-ocr {ocr_file} [encoding] \
-d {output_directory} [-r…
-
Hello
I'm trying to implement the 'mirador-textoverlay' plugin
to integrate ALTO files In the Mirador, to view a transcription of the image
It seems to me that there is a problem with Hebrew supp…
-
# Description
Some use cases need to get access to information stored in the OCR format:
- OCR correction scenario
- access to word confidence (see Issue #68)
- access to other kind of informati…
-
### Environment
* Python version: 3.11
* Nautobot version: 2.3.1
* nautobot-golden-config version: 2.1.1
I use XML Format for Palo Alto compliance Checks.
Current Config and intended config…
-
### Your Feature Request
It might be relatively simple to do this by looking at the hocrrenderrer https://github.com/tesseract-ocr/tesseract/blob/5f297dc0b8b500d57b7c073f4457e74ee537819f/src/api/hocr…