-
The `hocr` files are already `html` files and can be displayed in any browser. However, they will just display the text without any layout or format information. What do you think about doing some HTM…
-
The current implementation of the `--resolve-dir` option prefixes an image path passed on the command line unconditionally:
https://github.com/PRImA-Research-Lab/prima-page-viewer/blob/ad75ac2dc1b8…
-
PAGE-XML allows arbitrary recursion of regions. This is not always required or useful, but there are a number of places where a mild form of recursion is unavoidable (cf https://github.com/OCR-D/spec/…
-
Not sure if this a bug at all. I've used your pretrained BBZ model to segment pages in similar data: [`Börsenblatt des Deutschen Buchhandels`](http://digital.slub-dresden.de/id39946221X-18560530). The…
-
Hi Yury,
This is an amazing project! I've seen many examples, but have found your's to be very helpful! However, I was wondering if we could have a pagination option instead of viewing all the page…
-
It would be awesome if some or all models used throughout eynollah's workflow could be adapted to other domains by providing the tools for training. Ideally this would be complemented with some docume…
-
Hello there,
I have set up radian to run R code - and I did so successfully as R generally works in VSCode. However, fairly regularly some code snippets do not get executed when I hit 'control ente…
-
The current implementation extracts the ReadingOrder from the top-level parents of all `WORD` blocks (in the order of these word blocks). This seems to be necessary for cases with `TABLE` results.
…
-
It was [mentioned before](https://github.com/UB-Mannheim/ocr-fileformat/issues/121#issuecomment-579218186) but @cneud just reminded me of https://github.com/PRImA-Research-Lab/cloud-vision-ocr-to-page…
-
I'm currently testing pdf.js and stuck with the components examples such as the [simpleviewer](https://github.com/mozilla/pdf.js/blob/master/examples/components/simpleviewer.html). These work great wi…