Open mabablue opened 4 years ago
Notes transferred from Maria's document: UC6 data annotation (Manual or automated process for annotation of column headers/fields and streams. Could be done in real- or near-real time when the data are generated and subsequently transformed or in delayed mode.):
Notes:
Requires concepts that have persistent resolvable URIs
Requires terminologies based on simple atomic terms
ADDED - Requires (agreement of) top-level, domain-independent categorization scheme/ontology
Requires that the relationships be trusted
Requires terminologies with coarse/fine granularity
Requires a long-term commitment governance setup
Requires an active community supporting the terminology
Requires reliable technical infrastructure
Requires input from domain experts
Requires that the terminology be part of federated community specific and/or cross-domain portals
Requires that the terminology supports multilingual terms
Requires multilingual editorial team or multilingual community effort
Requires terminologies published as linked data capabilities
Requires the terminology to use a common minimum metadata schema to describe semantic artefacts and their content
Requires mappings between terminologies
Summary from Maria's input The following requirements are identified for manual annotation:
For automated annotation:
should we make two use cases - manual and automated data annotation?
[automated message] Updated top entry of this issue on 2020-07-11
[automated message] Updated top entry of this issue on 2020-09-21
Data annotation
Manual process for annotation of column headers/fields and streams. Could be done in real- or near-real time when the data are generated and subsequently transformed or in delayed mode.
Corresponding user stories
Requirements identified in the spreadsheet (and open for discussion)
last updated: 2020-09-21