OCR-D / zenhub

Repo for developing zenhub integration
Apache License 2.0
0 stars 0 forks source link

Importing Transkribus PAGE-XML #17

Open krvoigt opened 2 years ago

krvoigt commented 2 years ago

Current situation

Users cannot readily use the PAGE-XML results of Transkribus in an OCR-D environment, because Transkribus' flavor of PAGE-XML is based on the older 2013 variant and contains proprietary constructs and extensions.

How it should be

Users should be able to use Transkribus, e.g. for segmentation, and subsequently OCR-D for e.g. text recognition.

Steps

krvoigt commented 2 years ago

@kba does this belong to the MVP?