gbv / k10plus-subjects

Subject analysis of records in K10plus catalogue
0 stars 0 forks source link

Include LCC in clean dumps #18

Closed nichtich closed 2 years ago

nichtich commented 2 years ago

Library of Congress Classification (LCC) is present in more then 10% of any record with subject indexing data in PICA+ field 045A. LCC has been mapped to RDF (see https://id.loc.gov/authorities/classification.html) but a notationPattern is needed to get clean notations. The current raw dump also contains notations such as MLCS 86/04751 (P) not looking like valid LCC notations.