impresso / impresso-text-acquisition

🛠️ Python library to import OCR data in various formats into the canonical JSON format defined by the Impresso project.
https://impresso.github.io/impresso-text-acquisition/
GNU Affero General Public License v3.0
7 stars 2 forks source link

Disentangle the generic tetml importer and the fedgaz importer #94

Closed aflueckiger closed 4 years ago

aflueckiger commented 4 years ago

The code is documented.