-
The function `serialization.serialize_segmentation` is giving me a strange output. In the tag declaration at the top of the Alto XML document, I get duplicated `OtherTag` elements with weird labels. M…
-
ocrd-anybaseocr-dewarp with the command:
`ocrd-anybaseocr-dewarp -I OCR-D-DESKEW-PAGE -O OCR-D-DEWARP-PAGE -p '{\"pix2pixHD\":\"../ocr_all/pix2pixHD/\",\"latest_net_G.pth\":\"../ocr_all/pix2pixHD/m…
-
The processor 'segment-repir' ends wirh Exception "Exception: ocrd-segment-repair exited with non-zero return value 1" if it comes after processor 'cis-ocropy-segment' in the workflow. In a changed wo…
-
Currently, `ocrd-eynollah-segment` produces (empty) `TextEquiv` elements. I believe it should not produce any, as this results in OCR processors giving a lot of warnings:
```
22:28:22.628 WARNING …
-
Write a method for going from document text to predicted document segment classes. This should include:
* a method for extracting text (embedded text or OCR)
* a model
-
I am using gunicorn + Flask + Docker with async gevent workers. My application receives a request with a json body that contains a base64 encoded image. I then load this json, extract the image using …
-
From discussion on OCR-D/core#418:
> Additionally, IMO the coordinate checks should be made a little less strict (and thus more compatible with Aletheia) to avoid crying wolf.
>
> Things I see …
-
```
2022-07-27 14:30:11.211885: E tensorflow/stream_executor/cuda/cuda_driver.cc:271] failed call to cuInit: CUDA_ERROR_NO_DEVICE: no CUDA-capable device is detected
2022-07-27 14:30:11.211926: I te…
-
Currently, `ocrd_browser.view.page` tries to add an image version for each `AlternativeImage` referenced in the page. But that can lead to an uncaught `FileNotFoundError` during https://github.com/hne…
-
When I run the following command:
```bash
kraken -i OCR17plus/Data/Balzac1624_Lettres_btv1b86262420_corrected/png/Balzac1624_Lettres_btv1b86262420_corrected_0042.png results.txt segment -bl -i OCR17…