-
I tried to apply the suggested [workflow for slower processors](https://ocr-d.de/en/workflows#good-results-for-slower-processors). It failed in the last step:
```
20:42:49.364 INFO ocrd.task_sequenc…
-
I'm fired up about a rust implemented document parsing / embedding engine for my code and documents. Sadly, I don't see a good PDF ingestion in the code.
Ideally, I'd like to import PDFs from acad…
-
I'm are using a custom Layout Parser model, which is registered and has text, title, table. ... as categories.
I am trying to use pdfplumber detector and textextractionservice.
Code :
```
…
-
Currently we are using [trocr-base-printed](https://huggingface.co/microsoft/trocr-base-printed). Identify 3 other models and evaluate our OCR pipeline using these models. The model can be changed in…
-
Dear Development team,
Locking trought your code i've noticed that there is not an option to segment not only the structures but also the relevant ID that is often present in many patent (more or l…
-
Thanks for pointing out the perf impact by OCR on LiLT in your repo https://github.com/NielsRogge/Transformers-Tutorials/tree/master/LiLT where you mentioned " Please always use an OCR engine that can…
-
I have found an issue for a specific page but I am not sure what exactly the problem is other than that the page seems empty:
```c
20:43:59.071 DEBUG ocrd.processor.helpers.run_processor - Running…
-
I was testing a `streamlit` simple inference for `kraken`, segmentation -> recognition, for fast visual checking some models.
I get the segmentation "regions" from `baseline_seg = blla.segment(image,…
-
I have got the following exception (with loglevel 'trace') using `ocrd-cis-ocropy-resegment`with method 'ccomps':
```
15:16:17.857 INFO processor.OcropyResegment - Page "OCR-D-REG-DESKEW-4074_007817…
-
In previous work, we have seen how many segments in the record have either been split up incorrectly or merged incorrectly by the OCR engine. We should train a BERT model to split and merge these segm…