ld4lt / linguistic-annotation

Towards a consolidated LOD vocabulary for linguistic annotations
Other
15 stars 3 forks source link

Towards a consolidated LOD vocabulary for linguistic annotations

The representation of linguistic annotations on the web is important for (at least) two kinds of applications:

For representing linguistic annotations on the web, several vocabularies are currently in use. Most frequently used for linguistic annotation (in a LOD context) are

Another important family of standards is being developed by ISO (Linguistic Annotation Framework, LAF). This provides a general conceptual model that can be replicated in RDF.

Full overview on relevant LOD vocabularies and their use now under https://github.com/ld4lt/linguistic-annotation/tree/master/survey

Based on a survey conducted in 2019, NIF and Web Annotation are being actively used in both academia and industry, but issues exist with respect to interoperability and expressivity. Web Annotation is a W3C recommendation and thus stable. NIF 2.0 is a stable vocabulary, as well, and referred to in W3C standards (ITS), but its development is coordinated by a single institution. More recent NIF extensions (NIF 2.1 additions for provenance) seem to be partially documented only (there is no complete definition for NIF 2.1), and updates seem to have ceased since 2016.

Approach

Anti-pattern: The anti-pattern (https://xkcd.com/927/, CC-BY-NC 2.5)