hornc / plotMARC

Visually characterise a bibliographic collection in terms of publication dates and available bibliographic identifier coverage
GNU General Public License v3.0
3 stars 0 forks source link

Filter 035 fields for OCLC ids #8

Closed hornc closed 2 years ago

hornc commented 2 years ago

Currently the code is treating any MARC 035 field as an OCLC number, which is inaccurate.

Filter out 035s which do not contain OCLC numbers.

See https://github.com/internetarchive/openlibrary/blob/39962157c12465d050e43458444aba9a59661c8d/openlibrary/catalog/marc/parse.py Line 125 ~ for some regex ideas.