SSHOC / vocabularies

0 stars 0 forks source link

object-format property and IANA-media-types vocab. #11

Closed dpancic closed 1 year ago

dpancic commented 3 years ago

In GitLab by @laureD19 on May 19, 2021, 13:16

the object-formatproperty is a concept-based property using the IANA-media-types as concepts: https://www.iana.org/assignments/media-types/media-types.xhtml

for which we need to:

dpancic commented 3 years ago

In GitLab by @laureD19 on May 19, 2021, 13:16

added to epic &8

dpancic commented 3 years ago

In GitLab by @laureD19 on May 26, 2021, 14:49

@olanowak identified the list of values used for format during the HumanitiesData ingest (cf. BC message)

Based on this list, I have started a mapping between those formats and IANA-media-type concepts here. Still some curation needed because a lot of format used in HumanitiesData are more file extensions than media types, but this is a start and should allow us to add HumanitiesData formats as SKOS altLabel

dpancic commented 3 years ago

In GitLab by @vronk on May 27, 2021, 15:49

@vronk Here is the current ttl file: https://gitlab.gwdg.de/sshoc/vocabularies/-/blob/feature/iana-media-types/sshoc-mediatypes/sshoc-media.ttl

of course, need to incorporate Laure's work into this via the script

dpancic commented 3 years ago

In GitLab by @vronk on May 28, 2021, 13:50

Added possibility to have the mapping between Laure's file and IANA in the ttl file: see 8c9ccca8247728c98d246d5104a17494353375da and a result https://gitlab.gwdg.de/sshoc/vocabularies/-/blob/develop/sshoc-mediatypes/media-type.ttl#L881

Can be updated following steps in README.md

dpancic commented 3 years ago

In GitLab by @KlausIllmayer on Aug 2, 2021, 11:24

The vocabulary can be found here: https://gitlab.gwdg.de/sshoc/vocabularies/-/tree/master/iana-mediatypes

It is derived from IANA website and adds references to HumanitiesData-types using a Python tool from Yoann that transforms the data from there to SKOS. The SKOS is ingested into PoolParty, exported from there and ingested into MP identified by code iana-media-typ

dpancic commented 3 years ago

In GitLab by @vronk on Aug 4, 2021, 10:45

Note that the mapping between IANA and HumanitiesData is not finished, therefore the vocabulary is also not totally finished. I am not reopening the ticket since the vocabulary can be updated once the mapping is completed, see README.