Living-with-machines / lwmdb

A django-based library for managing the Living with Machines newspapers metadata database schema
https://living-with-machines.github.io/lwmdb/
MIT License
2 stars 0 forks source link

Database field names notes #93

Open mialondon opened 1 year ago

mialondon commented 1 year ago

Newspaper 'Publication code' - NLP

mialondon commented 1 year ago

What's the difference between 'Location' and 'Place of publication'? Is one distribution and the other nominated place of publication?

mialondon commented 1 year ago

In the Newspapers field: What are Issue code, Issue date and especially Input sub path?

kallewesterling commented 1 year ago

Newspaper 'Publication code' - NLP

The million-dollar question... I'm not sure anyone actually knows the origin of the NLP? But this seven-digit number is how we have navigated newspaper collections across the project all the time I've been here.

What's the difference between 'Location' and 'Place of publication'? Is one distribution and the other nominated place of publication?

These come from the Mitchell's + Gazetteer processing, if I surmise correctly from your question — in Mitchell's we have both a place of publication and a list of places (location/s) mentioned in the Mitchell's entry. Those are all linked to gazetteer places.

In the Newspapers field: What are Issue code, Issue date and especially Input sub path?

These come from the alto2txt codebase, where issue code is a unique identifier for the issue, the issue date is the publication date for the issue. Input sub path is a parameter passed through alto2txt to generate the XML files using the XSLT.. Honestly, I'm not really sure why this was kept in the schema, but I inherited it and didn't want to make substantial edits to the schema. I.e. if it's useful data for someone, we might as well keep it in there.