NYPL / catalog_of_copyright_entries_project

NYPL Project to transcribe and parse pages from the US Catalog of Copyright Entries
Creative Commons Zero v1.0 Universal
58 stars 13 forks source link

Normalize regDate #6

Closed seanredmond closed 6 years ago

seanredmond commented 6 years ago

DCL proposes normalizing the registration date to YYY-MM-DD

screen shot 2018-03-06 at 12 52 26 pm

<regDate>1951-03-06</regDate>
seanredmond commented 6 years ago

I can see a couple of other options:

Make the normalized version an attribute of regDate

<regDate nomalized="1951-03-06">6Mar51</regDate>

Or, taking a cue from #5, make the normalized data an attribute of copyrightEntry

<copyrightEntry id=”GUID” regnum=”A53538” regdate="1951-03-06">

and either leaving the regDate element for the unnormalized version or just doing away with it.

In my experience attributes are easier to deal with in processing than tag contents, so making the normalized date an attribute will be a plus for later script writers.

seanredmond commented 6 years ago

date attribute on various elements