Open Arithmeticus opened 4 years ago
Absolutely. I extended https://github.com/ld4lt/linguistic-annotation/blob/master/survey/required-features.md#b16-provenance-and-confidence accordingly. As this requirement is documented now, shall we close the issue or leave it open for further discussions?
How deep the requirements are meant to go at this stage and to what extent will other ontologies be used? Provenance can quickly become an immense topic. For example, "tools involved" could be expanded to include algorithms used for tokenization, etc. But this is probably a place where it makes sense to support ontologies like Prov.
(in case it's helpful as a reference, this poster gives some examples of considerations for using Prov for linguistic annotations https://www.rd-alliance.org/sites/default/files/Krohnposter.pdf , linked from https://www.rd-alliance.org/groups/research-data-provenance/wiki/rdaus-scholars-program-internship-abstract-and-poster.html/ )
Ah, I misunderstood the meaning of "provenance." Is this nestable? X claims that Y claims that Z claims that...[linguistic annotation]
On Wed, Nov 25, 2020 at 12:31 PM Bridget Almas notifications@github.com wrote:
How deep the requirements are meant to go at this stage and to what extent will other ontologies be used? Provenance can quickly become an immense topic. For example, "tools involved" could be expanded to include algorithms used for tokenization, etc. But this is probably a place where it makes sense to support ontologies like Prov.
(in case it's helpful as a reference, this poster gives some examples of considerations for using Prov for linguistic annotations https://www.rd-alliance.org/sites/default/files/Krohnposter.pdf )
— You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub https://github.com/ld4lt/linguistic-annotation/issues/3#issuecomment-733847729, or unsubscribe https://github.com/notifications/unsubscribe-auth/AAQD53SYZ2YTZVDV3KBR5OLSRU5OXANCNFSM4UCUDLLA .
-- Joel Kalvesmaki kalvesmaki.com
@balmas : Goal of requirement collection at the moment is to decide whether to start working with NIF or Web Annotation as a basis (starting spmething from scratch is probably not an option), and whether any of these need resivion or extension in that case and if so, how to organize the discussion of required features for developing a vocabulary for linguistic annotaitons. Not too deep thus ;)
@Arithmeticus : I guess the standard solution in an RDF context would nowadays be to resort to Prov-O (https://www.w3.org/TR/2013/NOTE-prov-primer-20130430/). Prov-O is not directly nestable (provenance is defined in terms of RDF properties only), but it is possible to use RDF reification.
In any case, we should formulate an explicit recommendation on how to model provenance and confidence, preferrably using external vocabularies if anything established can be found.
Confidence can become quite complex on its own: how it has been decided, tool/algorithm used to ascertain it, person who may have added it (if added by a person), etc. Do we want to go into such details?
@pennyl67 I think we can basically delegate this to Prov-O, but we should check that this really works.
I'll follow the lead of @balmas and suggest it important to be able to express what person(s) or organization(s) is (are) responsible asserting the claim behind a given linguistic annotation.