zotero / translators

Zotero Translators
http://www.zotero.org/support/dev/translators
1.3k stars 764 forks source link

COinS translator does not fetch edition from COinS dataset #794

Closed rm2342 closed 10 years ago

rm2342 commented 10 years ago

I have tried to fetch several books from the German National Library (e.g. here https://portal.dnb.de/opac.htm?method=showFullRecord&currentResultId=dreier+and+schulze+and+urhg%26any&currentPosition=0) using Zotero. However, the current edition is not fetched even though it is in the COinS code element.

The German National Library supports Zotero using COinS datasets. For the above book, the Zotero code reads:

                                                                <span class="Z3988" title="ctx_ver=Z39.88-2004&amp;rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&amp;rft.title=Urheberrechtsgesetz+%3A+Urheberrechtswahrnehmungsgesetz%2C+Kunsturhebergesetz+%3B+Kommentar+%2F+von+Thomas+Dreier+%3B+Gernot+Schulze&amp;rft.pub=Beck&amp;rft.isbn=978-3-406-62747-7&amp;rft.date=2013&amp;rft.genre=book&amp;rft.edition=4.+Aufl.&amp;rft.place=Mu%CC%88nchen"></span>    

Now, although there is an rft.edition element (here with the value "4"), the translator does not fetch it.

aurimasv commented 10 years ago

Thanks for reporting, See the pull request above.

It's not that Zotero doesn't parse the edition, the issue is mostly that when we look up metadata by the ISBN (that's in the COinS), the edition isn't included there.

rm2342 commented 10 years ago

If the German National Library does not supply that information in such cases, maybe we could contact them about this issue?

Am 31.10.14 um 19:38 schrieb Aurimas Vinckevicius:

Thanks for reporting, See the pull request above.

It's not that Zotero doesn't parse the edition, the issue is mostly that when we look up metadata by the ISBN (that's in the COinS), the edition isn't included there.

— Reply to this email directly or view it on GitHub https://github.com/zotero/translators/issues/794#issuecomment-61308447.

GPG-Fingerprint: 2F36 FEF7 DBDC 5C0D 8B0A B3DF DF67 D6B8 1A41 1A1B

aurimasv commented 10 years ago

I think reporting metadata errors or omissions is a good idea and it benefits everyone if the error is fixed (and possibly fixed for multiple items at the same time). I'm actually not sure which ISBN catalog the metadata is missing from, you'd have to take a look. Honestly, it's possible that the ISBN translator didn't scrape correctly (though I doubt it), so I would check that first.

zuphilip commented 10 years ago
(3)(+0000004): HTTP GET http://www.worldcat.org/oclc/846983744?page=endnotealt&client=worldcat.org-detailed_record

(3)(+0001812): Translate: Importing corrected RIS: 

TY  - ELEC
DB  - /z-wcorg/
DP  - http://worldcat.org
ID  - 846983744
LA  - German
T1  - Urheberrechtsgesetz Urheberrechtswahrnehmungsgesetz, Kunsturhebergesetz; Kommentar
A1  - Dreier, Thomas
A1  - Schulze, Gernot
PB  - Beck
CY  - München
Y1  - 2013///
SN  - 9783406627477 3406627471
ER  - 

Thus, the information is coming from worldcat, right? Does this mean that the coins translator tries to identify a ISBN and then just take whatever the ISBN search results? That might be dangerous...

The DNB provides much richer data format then just coins: marc21 (behind paywall?), RDF/XML, BIBFRAME. The BIBFRAME format (project lead by Library of Congress) is especially interesting, because this should become the new data format for libraries (and anyway "MARC must die!";-).

rm2342 commented 10 years ago

This looks like it is the Worldcat database, not the data from DNB. Thus, contacting DNB would not help.

WorldCat has the information about the edition in its database (see here http://www.worldcat.org/title/urheberrechtsgesetz-urheberrechtswahrnehmungsgesetz-kunsturhebergesetz-kommentar/oclc/846983744&referer=brief_results). So maybe there is an error in the RIS-generator at WorldCat?

Am 01.11.14 um 12:04 schrieb zuphilip:

|(3)(+0000004): HTTP GET http://www.worldcat.org/oclc/846983744?page=endnotealt&client=worldcat.org-detailed_record

(3)(+0001812): Translate: Importing corrected RIS:

TY - ELEC DB - /z-wcorg/ DP - http://worldcat.org ID - 846983744 LA - German T1 - Urheberrechtsgesetz Urheberrechtswahrnehmungsgesetz, Kunsturhebergesetz; Kommentar A1 - Dreier, Thomas A1 - Schulze, Gernot PB - Beck CY - München Y1 - 2013/// SN - 9783406627477 3406627471 ER - |

Thus, the information is coming from worldcat, right? Does this mean that the coins translator tries to identify a ISBN and then just take whatever the ISBN search results? That might be dangerous...

The DNB provides much richer data format then just coins: marc21 (behind paywall?), RDF/XML, BIBFRAME. The BIBFRAME format (project lead by Library of Congress) is especially interesting, because this should become the new data format for libraries (and anyway "MARC must die!" http://marc-must-die.info/index.php/Main_Page;-).

— Reply to this email directly or view it on GitHub https://github.com/zotero/translators/issues/794#issuecomment-61364786.

GPG-Fingerprint: 2F36 FEF7 DBDC 5C0D 8B0A B3DF DF67 D6B8 1A41 1A1B

zuphilip commented 10 years ago

I think we can close this one: