Open Archilegt opened 2 years ago
I finally decided to post a "batch part" for Archiv für Naturgeschichte 64, Band 1
Verhoeff, Carl (1898): Ueber Diplopoden aus Bosnien, Herzogowina und Dalmatien. IV. Theil: Julidae. Archiv für Naturgeschichte, 64:1 (1): 119-160 + pls. V-VI. https://www.biodiversitylibrary.org/part/6891 Remarks:
Verhoeff, Carl (1898): Ueber Diplopoden aus Bosnien, Herzogowina und Dalmatien. V. Theil: Glomeridae und Polyzoniidae (Schluss). Archiv für Naturgeschichte, 64:1 (2): 161-176 + pl. VII. https://www.biodiversitylibrary.org/part/226022 Remarks:
Verhoeff, Carl (1898): Kritisches, systematisch-historisch-litterarisches Verzeichniss der bis Ende 1897 beschriebenen Diplopoden von Oesterreich-Ungarn und dem Occupationsgebiet. Archiv für Naturgeschichte, 64:1 (3): 317-334. https://www.biodiversitylibrary.org/part/226025 Remarks:
Verhoeff, Carl (1898): Beiträge zur Kenntniss paläarktischer Myriopoden. VI. Aufsatz: Ueber paläarktische Geophiliden. Archiv für Naturgeschichte, 64:1 (3): 335-362 + pl. VIII. https://www.biodiversitylibrary.org/part/226026 Remarks:
Verhoeff, Carl (1898): Beiträge zur Kenntniss paläarktischer Myriopoden. VII. Aufsatz: Ueber neue und wenig bekannte Polydesmiden aus Siebenbürgen, Rumänien und dem Banat. Archiv für Naturgeschichte, 64:1 (3): 363-372 + pl. IX. https://www.biodiversitylibrary.org/part/226027 Remarks:
This "batch part" ends here. To be continued with articles from other journals.
Batch part for Zoologischer Anzeiger 21
Verhoeff, Carl (1898): Noch einige Worte über Segmentanhänge bei Insecten und Myriopoden. Zoologischer Anzeiger, 21 (549): 32-39. https://www.biodiversitylibrary.org/page/9739005 Publication date: 10/01/1898 Remarks:
Verhoeff, Carl (1898): Einige Worte über europäische Höhlenfauna. Zoologischer Anzeiger, 21 (552): 136-140. https://www.biodiversitylibrary.org/page/9739109 Publication date: 14/02/1898 Remarks:
Verhoeff, Carl (1898): Bemerkungen zur neuesten „Contribuzione alla conoscenza dei Diplopodi” des Dr. F. Silvestri. Zoologischer Anzeiger, 21 (555): 223-226. https://www.biodiversitylibrary.org/page/9739196 Publication date: 21/03/1898 Remarks:
This "batch part" ends here. To be continued with articles from other journals.
Hi @Archilegt, many thanks for the very useful feedback. Just to be clear, there are IMHO two separate issues here. The first is the assignment of BHL DOIs, the second is the quality of the metadata.
This is particularly remarkable for this article, as during the discussions on retro-DOIs at TDWG 2021, it was assured to me that BHL DOIs were carefully assigned after ensuring that the formed PDF would also include the plates elsewhere in a given volume.
BHL has a rather long and tortuous relationship with DOIs. The goal @nicolekearney and I articulated applies to newly minted DOIs (of the form p.nnnnn
). Some time ago BHL minted a set of DOIs for articles of the form bhl.part.nnnn
. These articles weren't checked for metadata quality, not all articles in those journals where identified, and not all identified articles were assigned DOIs. Neither @nicolekearney or I were involved in that initial batch.
Since 2020 we've been working to identify (as best as possible) all the articles in a journal, have those articles checked by volunteers, consult with existing publishers (if the journal already has DOIs) and CrossRef, then mint DOIs for (ideally) all articles in a journal that BHL has access to.
I would hope that newly minted DOIs (p.nnnnn
) will meet your expectations for what you'd expect from modern publisher (within the constraints that the majority of BHL content is not born digital).
The examples you give of articles lacking plates, or having incomplete metadata are well known problems. Most of the articles in BHL have been found using my semi-automated BioStor tools, which depend on the quality of metadata from various sources. If the source metadata lacks some details, so will BioStor. Given the scale of the task - identifying hundreds of thousands of articles in millions of pages - I really on automation to make some sort of headway.
The issue of missing plates is always frustrating, and typically is only resolved by manual inspection and correction.
For the articles in Archiv für Naturgeschichte 64, Band 1
I will add the missing plates that you've discovered. Regarding how to represent Band and Hefte, there seem to be multiple ways to do this, I note that ZOBODAT has:
Karl Wilhelm [Carl] Verhoeff (1898): Ueber Diplopoden aus Bosnien, Herzegowina und Dalmatien. IV. Theil: Julidae. – Archiv für Naturgeschichte – 64-1: 119 - 160.
I'll leave it as is in BioStor.
@Archilegt Regarding Zoologischer Anzeiger 21
the articles you mention have not been identified in BHL (yet). The challenge is always whether there is good quality metadata available, and finding the time to process that metadata and add it too BioStor (and hence to BHL). In the case of Zoologischer Anzeiger ZOBODAT seems an obvious source https://www.zobodat.at/publikation_series.php?id=20912
This batch part contains a single article.
Verhoeff, Carl (1898): Ueber Diplopoden aus Kleinasien. Verhandlungen der kaiserlich-königlichen zoologisch-botanischen Gesellschaft in Wien, 48: 292-305 + pls. IV-V. https://www.biodiversitylibrary.org/part/39235
Reception date: 25/03/1898 Remarks:
This "batch part" ends here. To be continued with one more article from another journal.
This batch part contains a single article.
Verhoeff, Karl (1898): Fauna diplopoda Bosne, Hercegovine i Dalmacije. Glasnik Zemaljskog muzeja u Bosni i Hercegovini, 10 (2): 467-491. http://www.bosniafacts.info/downloads/elibrary/category/4-glasnik-zemaljskog-muzeja-bosne-i-hercegovine-1889-2009?download=18:glasnik-zemaljskog-muzeja-bosne-i-hercegovine-1898-prvi-dio
Remarks:
This is the last "batch part" for "Verhoeff 1898".
Hi, @rdmpage! Many thanks for your feedback and insights, and for taking care of this issue. I didn't reply directly until now because I really had to focus and push through this curation challenge. It's very late for me now but I wasn't going to bed without thanking you. Have a good night!
Follow up: Regeneration of PDFs with plates successful for: 226022, 226026, 226027. Page interval of 226027 is now correct.
Regeneration of PDFs unsuccessful for: 6891 and 39235. Verhoeff, Carl (1898): Ueber Diplopoden aus Bosnien, Herzogowina und Dalmatien. IV. Theil: Julidae. Archiv für Naturgeschichte, 64:1 (1): 119-160 + pls. V-VI. https://www.biodiversitylibrary.org/part/6891 Remark: This is the only article with DOI. Maybe that is interfering.
Verhoeff, Carl (1898): Ueber Diplopoden aus Kleinasien. Verhandlungen der kaiserlich-königlichen zoologisch-botanischen Gesellschaft in Wien, 48: 292-305 + pls. IV-V. https://www.biodiversitylibrary.org/part/39235 Remark: This was the complex case of adding one more page and one more plate (and maybe related blank pages).
@rdmpage, could you please check this out?
@Archilegt Ah, I think this was my mistake 🤦♂️ I made the changes locally, but didn't push them to https://biostor.org, which means that BHL didn't get them. I've fixed this, so hopefully in a day or two you should be able to get new PDFs.
Many thanks, @rdmpage! I will follow up this issue and close it when I see the changes in BHL.
Follow up: Regeneration of PDF with plates successful for 39235.
Regeneration of PDF unsuccessful for 6891. Verhoeff, Carl (1898): Ueber Diplopoden aus Bosnien, Herzogowina und Dalmatien. IV. Theil: Julidae. Archiv für Naturgeschichte, 64:1 (1): 119-160 + pls. V-VI. https://www.biodiversitylibrary.org/part/6891
The PDF is not formed with plates. Plate V is found at https://www.biodiversitylibrary.org/page/14203474 Plate VI is found at https://www.biodiversitylibrary.org/page/14203476
Mea culpa, I'd updated "Ueber Diplopoden aus Bosnien..." locally but not passed those plates on to BHL. They should have a new PDF in the next day or so.
@mlichtenberg, could you please trigger the update needed above?
Hmmm, there are two issues with part 6891 (that is what is being referred to, correct?).
First, @rdmpage I don't see the item (49922) that segment appears in being sent to BHL in the last couple days. It was last sent on August 24. Second, that segment has a BHL-assigned DOI, so it will not accept updates from BioStor. BHL staff will need to update it by hand. Once that is done, the PDF should regenerate.
@mlichtenberg https://github.com/mlichtenberg The current version in BioStor has the plates http://biostor.org/reference/61689 http://biostor.org/reference/61689 I think because they hadn’t appeared in BHL I assumed I’d failed to update it, whereas I had on August 24th.
So it looks like the issue is the block due to the BHL-assigned DOI. Can I assume that BHL will add these extra plates?
On 31 Aug 2022, at 18:17, mlichtenberg @.***> wrote:
Hmmm, there are two issues with part 6891 (that is what is being referred to, correct?).
First, @rdmpage https://github.com/rdmpage I don't see the item (49922) that segment appears in being sent to BHL in the last couple days. It was last sent on August 24. Second, that segment has a BHL-assigned DOI, so it will not accept updates from BioStor. BHL staff will need to update it by hand. Once that is done, the PDF should regenerate.
— Reply to this email directly, view it on GitHub https://github.com/rdmpage/biostor/issues/97#issuecomment-1233209153, or unsubscribe https://github.com/notifications/unsubscribe-auth/AAAUK2VUXZKBZ6B5YRS5U2TV36HR3ANCNFSM56YHC4WA. You are receiving this because you were mentioned.
@rdmpage I added the plates to the segment, and the PDF has now been regenerated.
Thanks Mike, that’s great!
Wonderful! Many thanks @mlichtenberg and @rdmpage! I uploaded all the PDFs to the respective bibliographic references in Myriatrix.
Some summary statistics: For author=1, year=1, publications=10, one publication is not in BHL, three have no individual references (all from one journal), six have references. From the six with references, five needed curation. That gives a tiny idea of how much still needs to be done for having a complete "bibliography of life" at the author level. I will try expanding this with batch checks for Verhoeff for publication years related to the correspondence that I am processing.
@rdmpage, it may be that you would like to write some of what we did and learn here in the "Verhoeff paper" I am working on with other colleagues. The essence of it is: Martínez-Muñoz CA, Huff D, Meister M, Driller C (2022) Mobilizing and Enhancing Legacy Biodiversity Data: The case of Karl Wilhelm Verhoeff's correspondence. Biodiversity Information Science and Standards 6: e93679. https://doi.org/10.3897/biss.6.93679 but it is more than that and there is definitely space to add something about BioStor. Please, send me an email to my "archilegt" Gmail if you are interested.
Reopening to document bibliographic inconsistency in BHL for Archiv für Naturgeschichte 64, Band 1
Currently: Volume 64, Pages 119--160 https://www.biodiversitylibrary.org/part/6891 Volume 64, Series / Issue Issue: 1, Pages 161--176 https://www.biodiversitylibrary.org/part/226022 Volume 64, Series / Issue Issue: 1, Pages 317--334 https://www.biodiversitylibrary.org/part/226025 Volume 64, Series / Issue Issue: 1, Pages 335--362 https://www.biodiversitylibrary.org/part/226026 Volume 64, Series / Issue Issue: 1, Pages 363--373 https://www.biodiversitylibrary.org/part/226027
Observations: The first reference is better in that while missing the "volume + Band" value "64-1" (as in ZOBODAT) or "64:1" (as in Myriatrix), at least it does not introduce incorrect values as "Issue". In the following four references the issue is incorrect. Also note how the page interval of BHL part 226027 is still incorrect.
The corrected metadata should be: Volume 64-1, Series / Issue Issue: 1, Pages 119--160 https://www.biodiversitylibrary.org/part/6891 Volume 64-1, Series / Issue Issue: 2, Pages 161--176 https://www.biodiversitylibrary.org/part/226022 Volume 64-1, Series / Issue Issue: 3, Pages 317--334 https://www.biodiversitylibrary.org/part/226025 Volume 64-1, Series / Issue Issue: 3, Pages 335--362 https://www.biodiversitylibrary.org/part/226026 Volume 64-1, Series / Issue Issue: 3, Pages 363--372 https://www.biodiversitylibrary.org/part/226027 If value "64-1" is not permitted, then give "64" and correct the issue numbers as above. Correct page interval of BHL part 226027 is 363--372.
@mlichtenberg, is this something that you could manually update? Or what would it require?
Further details for me to investigate: The following PDFs are complete in ZOBODAT, including plates, the source is given as BHL, but there is no indication of whether they were harvested from BHL after the improvements documented here:
https://www.zobodat.at/pdf/Archiv-Naturgeschichte_64-1_0119-0160.pdf https://www.zobodat.at/pdf/Archiv-Naturgeschichte_64-1_0161-0176.pdf https://www.zobodat.at/pdf/Archiv-Naturgeschichte_64-1_0317-0334.pdf [N/A, PDF was always complete] https://www.zobodat.at/pdf/Archiv-Naturgeschichte_64-1_0335-0362.pdf
It is important to document the source and timing of the complete PDFs, to know how fixing something in BioStor improves not just BHL but also other databases like ZOBODAT. It would be important that if/when ZOBODAT harvests new PDFs, the BHL credit page is kept, among other things to allow knowing the PDF generation date. Currently the PDFs in ZOBODAT do credit BHL with a "watermark" on each page but no BHL credit page is present at the end of the PDFs, just the ZOBODAT credit page.
The following ZOBODAT reference has an incorrect page interval and no PDF associated: https://www.zobodat.at/publikation_articles.php?id=231334 Investigate if fixing the page interval at BHL and ZOBODAT would trigger the addition of a PDF from BHL part 226027 to the yet non-existent ZOBODAT URL: https://www.zobodat.at/pdf/Archiv-Naturgeschichte_64-1_0363-0372.pdf
The following PDF is complete in ZOBODAT, including plates, and the source is not BHL but ZOBODAT itself: https://www.zobodat.at/pdf/VZBG_48_0292-0305.pdf Investigate whether triggering an update replaces ZOBODAT PDFs with BHL PDFs.
@Archilegt I've submitted a ticket for the requested metadata updates to the BHL issue tracker.
This page (https://about.biodiversitylibrary.org/ufaqs/ive-noticed-a-problem-with-the-bhl-collection-or-website-what-can-i-do/) has a link to BHL's feedback form, which is where such requests can be submitted.
Many thanks, @mlichtenberg About the contact channel you suggested: The last time I tried using the form at https://www.biodiversitylibrary.org/contact#/comments, it wasn't working and it was not possible to know until trying to submit. I was trying to submit a bibliographic issue, e.g., request titles to scan. I then wrote to @udcmrk but I did not receive a reply. I will try the form again in the future, combined with documentation here in BioStor and other repositories I work on. If it doesn't work, I will try the feedback@biodiversitylibrary.org email in the link you provided. Thanks again!
A batch check on articles authored by Carl [Karl Wilhelm] Verhoeff in year 1898 is in progress. I may add one comment per article as I go, rather than just one long comment for all articles. I will indicate with each comment whether there are no issues or if issues have been detected and action can be taken for each individual article. Completion of this batch check may take a few hours. Please, be patient.