internetarchive / dweb-mirror

Offline Internet Archive project
https://www-dweb-mirror.dev.archive.org/
GNU Affero General Public License v3.0
272 stars 30 forks source link

Crawl UniversalLibrary > universalclassi10leiggoog #209

Closed mitra42 closed 5 years ago

mitra42 commented 5 years ago

Fails in _parse_common of JSON for bookreader which is "" Blocking #202

mitra42 commented 5 years ago

fixed - treat 0 length json in relative files as error (same as file not being there)

mitra42 commented 5 years ago

Similar issue with psychologyunders033255mbp/psychologyunders033255mbp_related.json just caught at different point for some reason checkValidFile fixed to ignore 0 length files