gitenberg-dev / Second-Folio

A spreadsheet tracking the status of the second GITenberg ebook collection _Second Folio_ with Recovering The Classics covers
Creative Commons Zero v1.0 Universal
2 stars 4 forks source link

incorrect paths for covers in metadata.yaml #18

Open rdhyee opened 9 years ago

rdhyee commented 9 years ago

Consider https://github.com/GITenberg/Adventures-of-Huckleberry-Finn_76/blob/0a41eef0f59c26a6c6527d74fbd315cbb035adb7/metadata.yaml, we have the following paths for covers:

we have

Besides cover.jpg, which is the RTC cover, there might be some correspondence betwen some of the rest of the images and those listed in the Project Gutenberg RDF. But between the drift in Gutenberg metadata and any reorganization of file structure from GITenberg, doing a match might not be straightforward.

rdhyee commented 9 years ago

What to do while we don't have a longterm fix in place? I can loop through the Second Folio repos and remove cover metadata that don't match the literal paths in the repo.

eshellman commented 9 years ago

So, either the initial paths (in rdf metadata) were wrong, or the files have been moved when they were loaded into gitenberg. If the latter, we should be able to fix the paths in yaml.

rdhyee commented 9 years ago

Re https://github.com/gitenberg-dev/Second-Folio/issues/18#issuecomment-128464033

If the latter, we should be able to fix the paths in yaml.

In the case of Huck Finn, by looking at the list of covers, we could manually do the remapping. Don't have any census of all the repos in general. For Second Folio, there aren't that many repos affected (5) -- so they could all be fixed by hand if we want.