gutenbergtools / ebookconverter

code that orchestrates ebook conversion for project gutenberg
GNU General Public License v3.0
7 stars 2 forks source link

Blank ebook pages on gutenberg.org #43

Closed mrcook closed 10 months ago

mrcook commented 10 months ago

I've encountered 4 missing ebooks from the gutenberg website: 38200, 57983, 64156, 65643. These pages contain no data, and only have a link to the RDF file. I discovered these because the RDF files also have blank data.

In the July 2021 newsletter, 65643 is listed as, "The lives of celebrated travellers, Vol. 2, by James Augustus St. John"

Related, but I guess not an eText release is ID 90907, which exists in the rdf-files.tar.zip offline archive, and also contains blank data.

P.S. I wasn't sure where to raise this issue so please point me to the correct place if needed.

eshellman commented 10 months ago

I keep a list of etext numbers that are "missing" or should be excluded from various processes

https://github.com/gitenberg-dev/gitberg/blob/master/gitenberg/data/missing.tsv

38200,

removed due to copyright issues

57983,

unused 64156,

unused 65643

missing

PG has no workflow for voiding an etext number and it happens for various reasons on an ad hoc basis. This is an appropriate place for the issue; as code here implements a number of related workflows. However, there are a number of informal workflows involved, and it's not clear that it's an urgent problem for anyone.

Eric

On Nov 3, 2023, at 10:16 AM, Michael R. Cook @.***> wrote:

I've encountered 4 missing ebooks from the gutenberg website: 38200, 57983, 64156, 65643. These pages contain no data, and only have a link to the RDF file. I discovered these because the RDF files also have blank data.

In the July 2021 newsletter, 65643 https://www.gutenberg.org/ebooks/65643 is listed as, "The lives of celebrated travellers, Vol. 2, by James Augustus St. John"

Related, but I guess not an eText release is ID 90907, which exists in the rdf-files.tar.zip offline archive, and also contains blank data.

P.S. I wasn't sure where to raise this issue so please point me to the correct place if needed.

— Reply to this email directly, view it on GitHub https://github.com/gutenbergtools/ebookconverter/issues/43, or unsubscribe https://github.com/notifications/unsubscribe-auth/AAHCGMLCHTIHWMNSRLI3CJLYCT4C7AVCNFSM6AAAAAA64QCTWKVHI2DSMVQWIX3LMV43ASLTON2WKOZRHE3TMMZQGE4TSMI. You are receiving this because you are subscribed to this thread.

mrcook commented 10 months ago

Thanks Eric. As they're all known then I'll close the ticket.