-
While running the FSCrawler o via docker compose, I face this error.
`2024-03-25 17:47:19 21:47:19,717 ERROR [f.p.e.c.f.c.FsCrawlerCli] job [doc_idx] does not exist. Exiting as we are in silent mode …
-
Hi @NielsRogge,
in your notebook [Fine_tune_LiLT_on_a_custom_dataset%2C_in_any_language.ipynb](https://github.com/NielsRogge/Transformers-Tutorials/blob/master/LiLT/Fine_tune_LiLT_on_a_custom_datas…
piegu updated
2 months ago
-
Scientific papers often share phylogenetic trees as images, which makes them hard to use without a conversion to a machine readable format like Newick.
I did a brief experiment to see if AI engines…
-
```
What steps will reproduce the problem?
1. Tesseract 3.02+ command line
2. "tesseract -l eng Image_crop.png Image pdf"
What is the expected output? What do you see instead?
> I expect tesseract…
-
```
What steps will reproduce the problem?
1. Tesseract 3.02+ command line
2. "tesseract -l eng Image_crop.png Image pdf"
What is the expected output? What do you see instead?
> I expect tesseract…
-
Checkout this project. it should be modular enough to just plug in https://github.com/bandrel/OCyara
-
Currently, we only specify how to describe the hierarchy of pages (represented by a set of files under `mets:structMap/mets:div/mets:div`) and their order. But nothing so far on logical structure **ac…
-
METS/PAGE/ALTO provided by digitization workflow software or repositories will not always adhere [to the conventions we have in OCR-D](https://ocr-d.de/en/spec). OTOH the workspaces that are the resul…
-
### Connector Name
destination-kafka
### Connector Version
0.1.10
### What step the error happened?
Other
### Relevant information
Hi Community,
First of all thank you very much …
-
```
What steps will reproduce the problem?
1. Tesseract 3.02+ command line
2. "tesseract -l eng Image_crop.png Image pdf"
What is the expected output? What do you see instead?
> I expect tesseract…