cessda / cessda.cdc.versions

Issue track and wiki for the CESSDA Data Catalogue
https://datacatalogue.cessda.eu/
Apache License 2.0
0 stars 0 forks source link

Use monolingual profile for DANS endpoint #604

Open john-shepherdson opened 10 months ago

john-shepherdson commented 10 months ago

The DANS metadata is either Dutch or English but not both in one record. They do not use languages tags for individual metadata fields. As a result, a lot of constrain violations relating to language tags are generated when using the multilingual profile for validation.

matthew-morris-cessda commented 10 months ago

Is there documentation anywhere about how the overall language of a record is specified?

john-shepherdson commented 10 months ago

Will ask Ricarda

john-shepherdson commented 10 months ago

They don't specify the language at present. I have been in touch with the Service Owner for guidance on how to proceed (include/exclude DANS endpoint).

john-shepherdson commented 10 months ago

Ricarda,

I discussed the lack of separation of the English and Dutch language metadata with Kristina (in CC). She rightly pointed out that it would be a backward step and an unwanted precedent to knowingly include such mixed content. Therefore the recommendation is that we exclude the new DANS endpoint from the upcoming release of CDC 3.4.0.

Please continue your work on adding language tags to your OAI-PMH records and feel free to contact me at any point for further testing and feedback re metadata quality. Once the tagging and testing is complete, we should be able to add the DANS endpoint to CDC at relatively short notice.

Regards,

John

matthew-morris-cessda commented 10 months ago

This is no longer part of CDC 3.4.0

alen-vodopijevec-cessda commented 1 week ago

@RicardaBraukmann please check the following example for a valid language specification.

Language can be specified at the document level within the element, or at the individual element level within a specific tag.

More examples can be found here

RicardaBraukmann commented 1 week ago

Thank you, I have asked my colleague Laura to have a look at this issue as she knows more about how our metadata output and endpoint is currently organized.