internetarchive / dweb-mirror

Offline Internet Archive project
https://www-dweb-mirror.dev.archive.org/
GNU Affero General Public License v3.0
261 stars 27 forks source link

Working with offline content from Jake's tool for Carl #280

Open mitra42 opened 4 years ago

mitra42 commented 4 years ago

See #64 (compatability with Jake's tool)

Carl Malamud has a use case where he has downloaded, using Jake's tool, a bunch of items for storage and availability offline in India.

This item is a Meta task for all the challenges of working with it :-)

See the #carl branch of dweb-mirror for any code below, its not merged because its currently incomplete, mostly untested and adds about 10Mb of dependencies without currently adding functionality.

mitra42 commented 4 years ago

Problem is that Jake's tool downloads files like _meta.xml rather than the JSON.

mitra42 commented 4 years ago
mitra42 commented 4 years ago

Fallback If have internet then can re-download metadata/json then get correctly scaled pages