AjaxMultiCommentary / ajmc-pipeline

Codebase for AjaxMultiCommentary
https://ajaxmulticommentary.github.io/ajmc-pipeline/
GNU Affero General Public License v3.0
4 stars 0 forks source link

add possibility to export a commentary page to ALTO/XML #1

Closed mromanello closed 2 years ago

mromanello commented 2 years ago

This concerns the class text_importation.classes.Page. Similarly to the method Page.to_json(), add a Page.to_alto() method.

For now: export only OLR and not OCR (but this could be done later).

See https://github.com/FoNDUE-HTR/FONDUE-FR-PRINT-17/tree/master/data/boileau_1674_alto_202203081708 for an example of Alto/XML data that use SegmOnto classes.

To be tested: import Alto/XML + image files (with SegmOnto subtype labels for regions) and see how they are rendered in the interface.