globalbioticinteractions / carvalheiro2023

GloBI configuration to help index Luisa Carvalheiro, José Augusto Salim, Filipi Soares, Debora Drucker. 2023. WorldFAIR pilot data from: VisitationData_Luisa_Carvalheiro.
0 stars 0 forks source link

Dataset semantic annotation (replacing column names by controlled vocab terms) #5

Open Filipi-Soares opened 9 months ago

Filipi-Soares commented 9 months ago

KENYA interactions_data_sheet(Apr 2023) Should we understand referenceDoi, referenceUrl, and referenceCitation as dct:identifier or dct:references?

jhpoelen commented 9 months ago

I'd say dct:references . . . and. . . I realize that the mapping may not be one-to-one. E.g., referenceDOI is a dct:references, but not the other way around.

jhpoelen commented 9 months ago

I'd suggest to use the term you are familiar with, instead of using the GloBI ones as a starting point.

jhpoelen commented 9 months ago

But perhaps @zedomel has some suggestions also.

Filipi-Soares commented 9 months ago

@jhpoelen, where can I find the list of bibliographic terms Globi uses? It’d be cool to start mapping these from the get-go. If Globi doesn’t cover everything, we could always mix in some terms from other vocabularies. That way, our dataset's description is on point, even if Globi can’t use all the extra metadata. See for instance this dataset from the USDA. I started mapping it to BIBO (Bibliographic Ontology), Schema.org and Dublin Core.

Filipi-Soares commented 9 months ago

Ops, just discovered the eml-literature.xsd module, I changing the mapping to EML. Having a lot of fun :smile:

Filipi-Soares commented 9 months ago

eml-2.1.1.json` is pretty handy to navigate EML tags @zedomel

jhpoelen commented 9 months ago

@jhpoelen, where can I find the list of bibliographic terms Globi uses? It’d be cool to start mapping these from the get-go. If Globi doesn’t cover everything, we could always mix in some terms from other vocabularies. That way, our dataset's description is on point, even if Globi can’t use all the extra metadata. See for instance this dataset from the USDA. I started mapping it to BIBO (Bibliographic Ontology), Schema.org and Dublin Core.

GloBI takes free form citation strings, dois and urls. For examples, see citations.tsv in https://globalbioticinteractions.org/data .

Note also https://github.com/globalbioticinteractions/globalbioticinteractions/issues/798 fyi @arw36

curious to hear your notes on how to use more structured bibliographic terms. I would welcome specific examples of current vs. expected .