How to use the generated PAGE-XML as input to TrOCR?

Sorry for the late reply!

Here is some background info on the PAGE-XML format:

https://github.com/PRImA-Research-Lab/PAGE-XML - repo with schema, some docs and examples
https://ocr-d.de/en/gt-guidelines/trans/trPage.html - explore the schema
https://primaresearch.org/publications/ICPR2010_Pletschacher_PAGE - the original publication introducing the format
https://github.com/PRImA-Research-Lab/prima-core-libs - Java library for working with PAGE-XML from its developers
https://github.com/OCR-D/core/tree/master/ocrd_models - Python helpers for working with PAGE-XML (in OCR-D context)

More specifically about your question, I am not too familiar yet with TrOCR, but I assume you would have to extract the text lines (TextLine) with their bounding boxes/polygons (Coords) from the PAGE-XML output of Eynollah to derive text line images and feed the according snippets to TrOCR for text recognition/prediction.

qurator-spk / eynollah

How to use the generated PAGE-XML as input to TrOCR? #58