How to save spelling variant annotations in STAM?

annotation / stam

Stand-off Text Annotation Model (STAM) is a data model for stand-off-text annotation where any information on a text is represented as an annotation. This repository contains the model's full specification, extensions, schemas, examples and documentation.

Creative Commons Attribution Share Alike 4.0 International

17 stars 2 forks source link

STAM is very open to however you want to model your data, so there is no single correct answer to this.

I'm not sure if I'm interpreting your use-case correctly, but you could make an annotation data set named spelling_variants with keys variant_text and edition (or variant_source?), and then do three annotations with a Text Selector on the same text span, each with annotation data like text = party, edition = LE etc... That way you can always independently add spelling variants from new editions as they become available and it's fairly easy to query variants given a specific edition.

If you're having doubts on how to create the data using Python and the stam library, see the part "Creating an annotation dataset (vocabulary)" in the tutorial notebook: https://nbviewer.org/github/annotation/stam-python/blob/master/tutorial.ipynb

annotation / stam

How to save spelling variant annotations in STAM? #27