clarin-eric / oai-harvest-manager

A simple Java application for managing an OAI-PMH harvesting workflow
14 stars 12 forks source link

Make sure that language is populated in the CMDI derived from OLAC metadata #3

Open twagoo opened 9 years ago

twagoo commented 9 years ago

See ticket #554 in CLARIN trac:

It matches the pattern /c:CMD/c:Components//c:OLAC-DcmiTerms/c:subject/@olac-language defined in facetConcepts.xml​. ... [..] behaviour of the importer is that these hard-coded patterns will be ignored if the CMDI profile contains at least one element that has a matching conceptlink. This is the case here ("OLAC-DcmiTerms/language", although empty). Therefore normally this attribute is not used to fill the language facet for this profile. ... [..] convince the creator to include values in the language element in the future. ... The direct creator of these CMDI's is the harvester, so maybe something can be done there, i.e. fill the 'language' field with the value from the attribute.

(Originally submitted as https://trac.mpi.nl/ticket/4271)

kjvandelooij commented 9 years ago

Original ticket has been closed.

twagoo commented 8 years ago

This issue still occurs, as is apparent from the values for the language facet in the recently added CoCoON collection. See CLARIN Trac #554 for a relevant discussion.

(It seems that the original ticket was only closed because it was migrated to GitHub)

twagoo commented 8 years ago

Notice that the case might be slightly different (from the one that was fixed primarily in CLARIN Trac #668 in that there actually is a value in these cases, e.g.:

<language olac-language="ita">Italien</language>
<language olac-language="deu">Allemand</language>
<language olac-language="eng">Anglais</language>

Maybe this is more of a mapping/curation issue. To be evaluated...