bat-literature / bat-literature.github.io

The Bat Literature Project aims to facilitate discovery of scientific literature on bats (Chiroptera)
Creative Commons Zero v1.0 Universal
0 stars 0 forks source link

(only!) 12 suspicious DOIs found in v0.4 and v0.5 #21

Closed jhpoelen closed 1 month ago

jhpoelen commented 2 months ago

After "clicking" on about 160k DOIs in v0.4, I found only 12 DOIs that were not redirected (code 302). Wow! Great work @ajacsherman !

Here's the breakdown of the associated DOIs -

$ cat doi-status.tsv | cut -f1  | sort | uniq -c | sort -nr
  15952 302
     11 404
      1 000

Here's the non-redirecting DOIs with their associated result code (404, 000).

Looks like the '000' is related to a 404 doi that was chopped off with whitespace / newline somehow (e.g., https://doi.org/10.1371/journal. and pone.0200303)

Also, it appears that for the https://doi.org/10.4404/hystrix–00571-2022 related DOIs, an em-dash was used instead of the expected hyphen. In replacing the em-dash with a hyphen, the DOI appeared to redirect ok.

404 https://doi.org/10.1111/j.2008.0030-1299.16212.x
404 https://doi.org/10.3724/SP.J.1141.2010.06633
404 https://doi.org/10.4404/hystrix–00571-2022
404 https://doi.org/10.4404/hystrix–00036-2017
404 https://doi.org/10.4404/hystrix–00237-2019
404 https://doi.org/10.4404/hystrix–00063-2018
404 https://doi.org/10.4404/hystrix–00019-2017
404 https://doi.org/10.4404/hystrix–00503-2021
404 https://doi.org/10.1371/journal.
000 pone.0200303
404 https://doi.org/10.1644/09-MAMM-A-325.1.Key
404 https://doi.org/10.1002/jcla
jhpoelen commented 2 months ago

For the v0.5 release, the suspicious DOIs remain similar to those reported for v0.4 -

404 https://doi.org/10.1111/j.2008.0030-1299.16212.x
404 https://doi.org/10.3724/SP.J.1141.2010.06633
404 https://doi.org/10.4404/hystrix–00571-2022
404 https://doi.org/10.4404/hystrix–00036-2017
404 https://doi.org/10.4404/hystrix–00237-2019
404 https://doi.org/10.4404/hystrix–00063-2018
404 https://doi.org/10.4404/hystrix–00019-2017
404 https://doi.org/10.4404/hystrix–00503-2021
404 https://doi.org/10.1371/journal.
000 pone.0200303
404 https://doi.org/10.1644/09-MAMM-A-325.1.Key
404 https://doi.org/10.1002/jcla

@ajacsherman are you aware of these DOI issues related to the batlit corpus. They seem relatively easy fixes, as they are mostly typos (em dashes -> dashes).

ajacsherman commented 2 months ago

Morning, My searches for these are not coming up with anything. Can you point me to the records these came from? Best, Aja

On Tue, Aug 20, 2024 at 4:24 PM Jorrit Poelen @.***> wrote:

For the v0.5 release, the suspicious DOIs remain similar to those reported for v0.4 -

404 https://doi.org/10.1111/j.2008.0030-1299.16212.x 404 https://doi.org/10.3724/SP.J.1141.2010.06633 404 https://doi.org/10.4404/hystrix–00571-2022 404 https://doi.org/10.4404/hystrix–00036-2017 404 https://doi.org/10.4404/hystrix–00237-2019 404 https://doi.org/10.4404/hystrix–00063-2018 404 https://doi.org/10.4404/hystrix–00019-2017 404 https://doi.org/10.4404/hystrix–00503-2021 404 https://doi.org/10.1371/journal. 000 pone.0200303 404 https://doi.org/10.1644/09-MAMM-A-325.1.Key 404 https://doi.org/10.1002/jcla

@ajacsherman https://github.com/ajacsherman are you aware of these DOI issues related to the batlit corpus. They seem relatively easy fixes, as they are mostly typos (em dashes -> dashes).

— Reply to this email directly, view it on GitHub https://github.com/bat-literature/bat-literature.github.io/issues/21#issuecomment-2299701035, or unsubscribe https://github.com/notifications/unsubscribe-auth/AXI3CB7DIK2LUBOS3G52HUDZSOQXXAVCNFSM6AAAAABL47WNQ2VHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDEOJZG4YDCMBTGU . You are receiving this because you were mentioned.Message ID: @.***>

-- Aja Sherman MS Bat Eco-Interactions Database Curator https://batbase.org/ 914-886-8906 @. @. she/her

ajacsherman commented 2 months ago

Good morning,

These DOIs have been edited. Can you run the report and let me know if there are remaining corrupt links? Any other suspicious links, citations, etc.?

https://doi.org/10.1371/journal.pone.0200303 and 10.1002/jcla.20205 https://doi.org/10.1002/jcla.20205 lacked mention to bats, so I deleted these records.

Hope you had a good weekend,

Aja

On Tue, Aug 20, 2024 at 4:24 PM Jorrit Poelen @.***> wrote:

For the v0.5 release, the suspicious DOIs remain similar to those reported for v0.4 -

404 https://doi.org/10.1111/j.2008.0030-1299.16212.x 404 https://doi.org/10.3724/SP.J.1141.2010.06633 404 https://doi.org/10.4404/hystrix–00571-2022 404 https://doi.org/10.4404/hystrix–00036-2017 404 https://doi.org/10.4404/hystrix–00237-2019 404 https://doi.org/10.4404/hystrix–00063-2018 404 https://doi.org/10.4404/hystrix–00019-2017 404 https://doi.org/10.4404/hystrix–00503-2021 404 https://doi.org/10.1371/journal. 000 pone.0200303 404 https://doi.org/10.1644/09-MAMM-A-325.1.Key 404 https://doi.org/10.1002/jcla

@ajacsherman https://github.com/ajacsherman are you aware of these DOI issues related to the batlit corpus. They seem relatively easy fixes, as they are mostly typos (em dashes -> dashes).

— Reply to this email directly, view it on GitHub https://github.com/bat-literature/bat-literature.github.io/issues/21#issuecomment-2299701035, or unsubscribe https://github.com/notifications/unsubscribe-auth/AXI3CB7DIK2LUBOS3G52HUDZSOQXXAVCNFSM6AAAAABL47WNQ2VHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDEOJZG4YDCMBTGU . You are receiving this because you were mentioned.Message ID: @.***>

-- Aja Sherman MS Bat Eco-Interactions Database Curator https://batbase.org/ 914-886-8906 @. @. she/her

jhpoelen commented 2 months ago

Hey @ajacsherman,

Thanks for responding to these suspicious DOI reports.

Re-running to DOI report would be done on the next release, if there is one. I realize that any release will have stuff to improve, so at some point we'll have to call it done. With the major improvements you've made between v0.4 and v0.5, I wonder whether it is worth the efforts (it takes some time to do a release due to the Zotero Web API-based architecture: all records have to be revisited all the time to make versioned snapshots).

Also, I am curious what Ariadna and @n8upham have to say about v0.5 . So far they were the ones expressing an interest in reviewing v0.5, and they haven't yet reported any comments as far as I can tell.

Curious to hear your thoughts, -jorrit

jhpoelen commented 1 month ago

closing issue in favor of a more recent DOI report in https://github.com/bat-literature/bat-literature.github.io/issues/30