Open rdhyee opened 9 years ago
What to do while we don't have a longterm fix in place? I can loop through the Second Folio repos and remove cover metadata that don't match the literal paths in the repo.
So, either the initial paths (in rdf metadata) were wrong, or the files have been moved when they were loaded into gitenberg. If the latter, we should be able to fix the paths in yaml.
Re https://github.com/gitenberg-dev/Second-Folio/issues/18#issuecomment-128464033
If the latter, we should be able to fix the paths in yaml.
In the case of Huck Finn, by looking at the list of covers, we could manually do the remapping. Don't have any census of all the repos in general. For Second Folio, there aren't that many repos affected (5) -- so they could all be fixed by hand if we want.
Consider https://github.com/GITenberg/Adventures-of-Huckleberry-Finn_76/blob/0a41eef0f59c26a6c6527d74fbd315cbb035adb7/metadata.yaml, we have the following paths for covers:
we have
Besides cover.jpg, which is the RTC cover, there might be some correspondence betwen some of the rest of the images and those listed in the Project Gutenberg RDF. But between the drift in Gutenberg metadata and any reorganization of file structure from GITenberg, doing a match might not be straightforward.