NYPL / catalog_of_copyright_entries_project

NYPL Project to transcribe and parse pages from the US Catalog of Copyright Entries
Creative Commons Zero v1.0 Universal
58 stars 13 forks source link

Entries with multiple registration numbers #1

Closed seanredmond closed 6 years ago

seanredmond commented 6 years ago

When an entry has multiple registration numbers DCL proposes to split them into several entries.

screen shot 2018-03-06 at 12 19 31 pm screen shot 2018-03-06 at 12 19 47 pm

seanredmond commented 6 years ago

We'll want to make sure we can properly group them together. Perhaps add a splitFrom attribute (better name?) to the copyrightEntry element that would be an IDREF to the id of the first in the group?

seanredmond commented 6 years ago

Looking at #5, if we keep the registrationNumber element maybe we also need a registrationNumberRange element? I would propose:

<registrationNumberRange>
    <registrationNumber>A 159923</registrationNumber>, <registrationNumber>159924</registrationNumber>
</registrationNumberRange>

<registrationNumberRange>
    <registrationNumber>A 160078</registrationNumber>-<registrationNumber>160078</registrationNumber>
</registrationNumberRange>

It would require something similar for the volumes. The first example is something different altogether.

seanredmond commented 6 years ago

This is partly fixed by #21, the first example completely when there's a "tail" of additional registrations. The other examples are trickier since they're embedded and all kind of mixed up.