buda-base / xmltoldmigration

App to migrate from TBRC XML files to BDRC RDF LD
Apache License 2.0
0 stars 2 forks source link

lccn casing #80

Open eroux opened 5 years ago

eroux commented 5 years ago

Most of the time the lccn data needs to be upper cased (hence a normalization in that direction in xmltold), but sometimes it doesn't... for instance

xristy commented 5 years ago

The format of LCC call numbers (note these are distinct form LCCN's) doesn't mention lower case letters. I was able to find an example LCC call number that has a lower case letter in the Cutter number: PZ7.J684 Wj 1982 but the rules are rather involved.

We could upcase the LCC call number up to the year which involves a bit of parsing which would mess up the example above but the search by numbers is case insensitive so perhaps it doesn't matter too much if some lower case letters are erroneously up cased.

eroux commented 5 years ago

I think the lead to automatically sync them is probably our best shot, we just need to find a contact

xristy commented 5 years ago

Up to you