htrc / htrc-feature-reader

Tools for working with HTRC Feature Extraction files
39 stars 12 forks source link

Page image module #20

Closed organisciak closed 4 years ago

organisciak commented 7 years ago

The HT DL images have a predictable URL. If accessing from an acceptable IP and for a public domain book, a feature to produce an image URL for a page would be valuable for testing. Even better: a Jupyter module for displaying the HTML image.

All things being equal, this would save HT bandwidth (versus loading a book and scrolling through irrelevant pages), but has the potential for supporting abusive loads. Perhaps an optional but on-by-default setting that rate-limits any pings to them?