NatLibFi / bib-rdf-pipeline

Scripts and configuration for converting MARC bibliographic records into RDF
Creative Commons Zero v1.0 Universal
29 stars 5 forks source link

Drop useless instances generated from 530 fields #82

Open osma opened 6 years ago

osma commented 6 years ago

For example, the work W00508353600 has four instances, two of which only have the note "Julkaistu myös painettuna". These are in principle links to other instances, but with so little information that it's practically impossible to actually make the link. Thus they are useless and should be dropped from the output.

osma commented 6 years ago

There are about 94k 530 notes. Of these, about 24k contain an ISBN. Maybe turning them (at least the ones without ISBN) into other types of notes (e.g. 500) would make sense.