buda-base / buda-iiif-server

the buda Image server based on hymir iiif-server
MIT License
1 stars 2 forks source link

use ebooks when present and relevant #5

Open eroux opened 6 years ago

eroux commented 6 years ago

In Some cases (full volume downloads), it way be relevant to use the ebook if it's present on S3. Not all are generated but when they are it seems they have the following URL convention:

s3://archive.tbrc.org/Works/{md5}/{workid}/eBooks/{workid}-{volumenumber}.pdf

with the volume number padded on 3 digits. Example:

s3://archive.tbrc.org/Works/60/W22084/eBooks/W22084-001.pdf

it's not really any kind of priority, but I thought I'd report this to provide some awareness on the existence of these. The main difference with normal PDF is that they contain bookmarks, table of contents, a cover and a copyright notice. The ones on S3 are quite old so it may be a bad idea to use them, maybe the new ones would make more sense... (but they're not on S3 yet?).

MarcAgate commented 5 years ago

That's an interesting feature. In particular, we could use some "ebook generator" (do we have one ?) and store the new ebooks on S3, on the fly, as they are requested. I believe this applies only to "unicode" works, since it doesn't bring much value to "integrate" images in an ebook. But it would be great, given some library search results to have a corresponding list of available ebooks.