inventaire / inventaire

a libre collaborative resource mapper powered by open-knowledge, starting with books! :books:
https://inventaire.io
440 stars 28 forks source link

remove ISBNs status of canonical edition URIs ids #192

Open maxlath opened 6 years ago

maxlath commented 6 years ago

preferring inv URIs over isbn URIs would have several advantages:

Requirements:

maxlath commented 6 years ago

examples of several editions for one ISBN:

belett commented 6 years ago

In fact, there is (at least) two different situations that broke the 1 edition = 1 ISBN :

The first case is weird but the second case is the most trickiest because when someone has the book in them hands, there is no way to know that the ISBN is wrong.

jum-s commented 5 years ago

2 editors and 2 isbns on one edition, what a funny world we live in.. The "wrong" isbn case happens because of database attempts defined before printing. This a priori definition do not fit the physical world. Can we then say the printed isbn actually is the correct one ? Systems that use code bar scanners would. Human work sourcing and/or editors database sourcing.

maxlath commented 5 years ago

maybe we need to introduce Wikidata-like references in Inventaire data model to say that the cover says something and the database something else ;)

belett commented 5 years ago

Can we then say the printed isbn actually is the correct one ?

In theory, we could. In some cases that could work but most of the time, this would be weird and dangerous.

Here some examples:

Finally, it depends on what database we talk about, it could be "defined before printing" but also "corrected after printing". For an official database like the BnF catalog - which is in charge of giving and administrating ISBN in France - it almost always the second case.

maxlath commented 5 years ago

@belett:

it's not uncommon for one edition to have multiple ISBN per format [source]

not sure how we should deal with this kind of case

belett commented 5 years ago

On the wikidata side, multiple ISBN per edition should always have qualifiers. Maybe you can use these qualifiers?

maxlath commented 5 years ago

another issue created by ISBN uris, consider the following scenario:

This change of item entity uri already happens in the case of an entity merge, but not on entity claims update. Removing the status of canonical uri to ISBN would solve this issue: no more need to update URIs on claims updates

maxlath commented 4 years ago

doing some ISBN cleanup in the database as we got some madness in there:

this bad conversions 979 ISBN-13 -> ISBN-10 -> 978 ISBN-13 are good candidates to explain all the reports we got for ISBNs that were already used by a different book, especially between the 978-1-0 and the 979-10 groups. Here after, an examples of entities that I had to manually split in the database, because both had where based on 9781090648471 (but with different hyphenation, due to the lib change, yeay):

maxlath commented 4 years ago

a new illustration of the problem isbn (source)