qurator-spk / dinglehopper

An OCR evaluation tool
Apache License 2.0
58 stars 12 forks source link

Add TEI support #115

Open mikegerber opened 3 months ago

mikegerber commented 3 months ago

Motivated by some experiments with the corpus of Deutsches Textarchiv, it would be convenient if we could read TEI.

mikegerber commented 1 month ago

I have some code for this, but it still needs more work (bugs + not elegant)