rdmpage / biostor

Open access articles extracted from the Biodiversity Heritage Library
http://biostor.org
5 stars 2 forks source link

Proceedings of the Entomological Society of Washington replaced #88

Open rdmpage opened 4 years ago

rdmpage commented 4 years ago

Some volumes of Proceedings of the Entomological Society of Washington have been replaced, breaking BioStor articles (and PageIDs in projects like NZ). Volumes affected include http://biostor.org/issn/0013-8797/year/1937 and http://biostor.org/issn/0013-8797/year/1938

crowleyb commented 4 years ago

Hi Rod,

Thanks for the notice. I’m not sure how the process works but is there a way to rerun the process to identify articles on the new volumes?

The reason the volumes were replaced is likely due to some error in the digitization quality, say a page or series of pages was missing.

If we put our heads together I’m sure we can find a process that allows us to curate the BHL collection while also maintaining the assets we support downstream. I’ve included my Tech Team colleagues to chime in with ideas when they can.

Thanks, Bianca


From: Roderic Page notifications@github.com Sent: Thursday, February 20, 2020 7:29:56 AM To: rdmpage/biostor biostor@noreply.github.com Cc: Subscribed subscribed@noreply.github.com Subject: [rdmpage/biostor] Proceedings of the Entomological Society of Washington replaced (#88)

External Email - Exercise Caution

Some volumes of Proceedings of the Entomological Society of Washington have been replaced, breaking BioStor articles (and PageIDs in projects like NZ). Volumes affected include http://biostor.org/issn/0013-8797/year/1937https://nam02.safelinks.protection.outlook.com/?url=http%3A%2F%2Fbiostor.org%2Fissn%2F0013-8797%2Fyear%2F1937&data=02%7C01%7Ccrowleyb%40si.edu%7C48c77e93c4de49fe482708d7b6009891%7C989b5e2a14e44efe93b78cdd5fc5d11c%7C0%7C0%7C637177985999537038&sdata=2GqWTYA4Ma4x5XUvdye7PySNKsai17bgd37TY0HCOe8%3D&reserved=0 and http://biostor.org/issn/0013-8797/year/1938https://nam02.safelinks.protection.outlook.com/?url=http%3A%2F%2Fbiostor.org%2Fissn%2F0013-8797%2Fyear%2F1938&data=02%7C01%7Ccrowleyb%40si.edu%7C48c77e93c4de49fe482708d7b6009891%7C989b5e2a14e44efe93b78cdd5fc5d11c%7C0%7C0%7C637177985999547035&sdata=sjOXrLStYQG6Mx315Ti%2ByR2koWcv%2F%2F7Pr4WnTOdC4nI%3D&reserved=0

— You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHubhttps://nam02.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgithub.com%2Frdmpage%2Fbiostor%2Fissues%2F88%3Femail_source%3Dnotifications%26email_token%3DAC47PTKXSAUC3A3W2ZY3OHTRDZZUJA5CNFSM4KYN2MIKYY3PNVWWK3TUL52HS4DFUVEXG43VMWVGG33NNVSW45C7NFSM4IO7B6ZA&data=02%7C01%7Ccrowleyb%40si.edu%7C48c77e93c4de49fe482708d7b6009891%7C989b5e2a14e44efe93b78cdd5fc5d11c%7C0%7C0%7C637177985999547035&sdata=7VZjHVDkYbdAbFF6Etq7SIVlgM9jbFxP68B0V90D1I4%3D&reserved=0, or unsubscribehttps://nam02.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgithub.com%2Fnotifications%2Funsubscribe-auth%2FAC47PTLNCC35MHNI6U2ARELRDZZUJANCNFSM4KYN2MIA&data=02%7C01%7Ccrowleyb%40si.edu%7C48c77e93c4de49fe482708d7b6009891%7C989b5e2a14e44efe93b78cdd5fc5d11c%7C0%7C0%7C637177985999557028&sdata=LwrpHwCdCzYOBYYmpz3%2BQ1axHZ3OQ3vEofsUEcj4Zfk%3D&reserved=0.

rdmpage commented 4 years ago

@crowleyb Probably the way forwards to create a mapping between old and new pages (i.e., using their PageIDs), then map the articles to the new identifiers (articles are basically ranges of PageIDs). It would be nice if these updates could also be communicated to BioStor. At present, the articles are simply abandoned, and if there's a redirect for an old PageID it simply goes to the first page in the new BHL Item. This means I have to semi-manually remap the articles to the knew content, which is tedious to say the least.