scriptotek / mc2skos

Command line script for converting Marc21 Classification and Authority records to SKOS/RDF
The Unlicense
21 stars 4 forks source link

Test with Regensburger Verbundklassifikation (RVK) data and add RVK examples #24

Closed nichtich closed 6 years ago

nichtich commented 7 years ago

Regensburger Verbundklassifikation (RVK) is available in MARCXML on request. The license and URI structure however is not clear. University Library Mannheim once had a prototype of RVK in RDF:

Phu2 commented 6 years ago

A tiny test with the first 3 records from the MarcXML dump snippet.txt

command mc2skos -v snippet2.txt > snippet.txt.ttl

gives [2018-01-08 23:47:53,491 INFO] Parsing: snippet2.txt [2018-01-08 23:47:53,524 WARNING] Unknown value: n [2018-01-08 23:47:53,524 DEBUG] n [2018-01-08 23:47:53,525 DEBUG] Ignoring A because: not intended for display [2018-01-08 23:47:53,525 WARNING] Unknown value: n [2018-01-08 23:47:53,526 DEBUG] n [2018-01-08 23:47:53,526 DEBUG] Ignoring AA because: not intended for display [2018-01-08 23:47:53,526 WARNING] Unknown value: n [2018-01-08 23:47:53,527 DEBUG] n [2018-01-08 23:47:53,527 DEBUG] Ignoring AA 09900 because: not intended for display [2018-01-08 23:47:53,527 WARNING] RDF result is empty! Looks like the values in 008 are not valid according to the specification, see positions 06 and 13. I sent them an email.

Phu2 commented 6 years ago

i had an old version, the problem with the 008 field has been fixed in the current dump (2017_4).

nichtich commented 6 years ago

The URI format still has to be specified until there is an official one but you can start with an arbitrary prefix such as http://example.org/rvk/. I'll add the official RVK URI format to mc2skos when it has been decided so no additional calling parameters will be needed.