periodo / periodo-data

Tracking PeriodO data quality issues
http://perio.do
The Unlicense
5 stars 0 forks source link

Missing authors / creators / contributors in authority source metadata #59

Open atomrab opened 7 years ago

atomrab commented 7 years ago

This may be related to periodo/periodo-client#90. In at least one instance, a call to a Worldcat record with a clear set of authors, when adding a new collection, pulls in the title and the date but not the creators (http://www.worldcat.org/oclc/892462417). This is a problem if we're trying to make it easy to keep track of intellectual genealogies. I haven't tested to see if the problem is specific to this title, or a current problem with Worldcat titles in general.

ptgolden commented 5 years ago

This is fixed by 93f4d0d08cbe11da8d16b57580f896fdefdf9c91. OCLC seems to use schema:author to mark the author of a work. We were only checking for dc:creator and schema:creator.

I'll go through WorldCat sources and see which ones are missing creators.

ptgolden commented 5 years ago

It must be the case that somewhere along the way, OCLC changed the predicate they used to describe authorship, because a number of WorldCat sources have creators. My guess is 2017, around when you filed this issue.

ptgolden commented 5 years ago

Here are the candidates:

http://www.worldcat.org/oclc/489875840 (in authority p08p8mm) http://www.worldcat.org/oclc/827706625 (in authority p064v7m) http://www.worldcat.org/oclc/37663433 (in authority p05krdx) http://www.worldcat.org/oclc/47825690 (in authority p0cfv7g) http://www.worldcat.org/oclc/489877202 (in authority p0pp7vz) http://www.worldcat.org/oclc/185123526 (in authority p0qp9rs) http://www.worldcat.org/oclc/945765738 (in authority p0pf2qb) http://www.worldcat.org/oclc/489875825 (in authority p0sgtfr) http://www.worldcat.org/oclc/593325476 (in authority p0wnvm4) http://www.worldcat.org/oclc/895860433 (in authority p0zpgf9) http://www.worldcat.org/oclc/894969754 (in authority p0xkgmr) http://www.worldcat.org/oclc/802950569 (in authority p0fk6s4) http://www.worldcat.org/oclc/643777402 (in authority p0dkm29) http://www.worldcat.org/oclc/869568061 (in authority p088hzz) http://www.worldcat.org/oclc/473370569 (in authority p0hsq83) http://www.worldcat.org/oclc/1028624877 (in authority p0jgnvq)

I'll check them out after we deploy the production server

rybesh commented 5 years ago

Moving to periodo-data issues since this has been fixed in the client.

rybesh commented 3 years ago

These authorities still need to be fixed: