internetarchive / openlibrary

One webpage for every book ever published!
https://openlibrary.org
GNU Affero General Public License v3.0
5.01k stars 1.27k forks source link

incompete dates on AO items default to XX00 dates #6737

Open DuncanDHall opened 2 years ago

DuncanDHall commented 2 years ago

Old issue raised by Jeff Kaplan in 2017 in Jira: https://webarchive.jira.com/browse/OL-317

When a book's archive.org/metadata lists publication date as "[18--?]" it shows up in openlibrary as specifically "1800".

Evidence / Screenshot (if possible)

https://archive.org/details/plaintalesfromth00kipliala image https://archive.org/metadata/plaintalesfromth00kipliala image https://openlibrary.org/books/OL7197646M/Plain_tales_from_the_hills image

Proposal & Constraints

Interpret this form of date metadata differently, perhaps as "1800s"?

Stakeholders

@JeffKaplan @mekarpeles @hornc

LeadSongDog commented 2 years ago

Related discussions on edition date issues: #806 #2039 #2711 #3391 #3421 #3496 #3746 #5333 #6021 #6038

tldr: consensus is to convert to, store, and sort on EDTF formats