proycon / foliatools

A number of command-line tools for working with FoLiA (Format for Linguistic Annotation). Includes validators, converters, visualisers, and more.
GNU General Public License v3.0
10 stars 4 forks source link

FoLiA to STAM conversion #53

Closed proycon closed 7 months ago

proycon commented 1 year ago

Implement a folia2stam tool to export to STAM. This is also relevant for proycon/folia#102 as STAM will acts as the pivot model to convert to Web Annotation.

This tool will essentially split the text from the annotations (which we sometimes refer to as 'untangling'). The main challenge is getting the offsets right, but we already have FoLiA tooling that should help there.

The FoLiA-specific vocabulary will be maintained, this also relates to proycon/folia#4 .

proycon commented 7 months ago

A preliminary version has been implemented. It works decently but just needs to be documented and released still.