internetarchive / dweb-mirror

Offline Internet Archive project
https://www-dweb-mirror.dev.archive.org/
GNU Affero General Public License v3.0
263 stars 27 forks source link

Dependencies on dweb.me or dweb.archive.org #242

Closed mitra42 closed 4 years ago

mitra42 commented 5 years ago

It would be better if dweb-mirror wasn't dependent on dweb.me, that could involve a) calling archive.org directly where appropriate, b) a + performing data manipulation on return rather than in dweb.me c) microservices running on dweb.me that dont depend on the core deb.me server

Next comment will be list of places

(Scroll down to "ROUND 4") but all issues now have own topics so closing

mitra42 commented 5 years ago

Metadata - see comment below DONE

Note this is in parallel to any work done in dweb-transports to split it up and re-modularize it.

mitra42 commented 5 years ago

Archive.py / ArchiveItem DONE

mitra42 commented 5 years ago

Archive.py ArchiveFile < /arc/archive.org/download (which goes via ArchiveItem I think)

mitra42 commented 5 years ago

Archive.js / AdvancedSearch < /arc/archive.org/advancedsearch

mitra42 commented 4 years ago
mitra42 commented 4 years ago

ContentHash

mitra42 commented 4 years ago

/arc/archive.org/metadata and ArchiveItem.metadata

mitra42 commented 4 years ago
mitra42 commented 4 years ago

thumbnaillinks

mitra42 commented 4 years ago

Python/ServerGateway

mitra42 commented 4 years ago

Code rescan

Done: DAC; IAUX DA, DM, dweb-torrent, dweb-metadata for dweb.me dweb.archive.org upstreamPrefix gatewayServer httpUrl

Strategy

Places

mitra42 commented 4 years ago

Torrent file handling

mitra42 commented 4 years ago

metadata/IDENTIFIER/NAME

mitra42 commented 4 years ago

/info

Note DA.SettingsComponent and DA.LocalComponent go to mirror correctly

mitra42 commented 4 years ago

ROUND 3

All these below moved to ROUND 4 below

mitra42 commented 4 years ago

FAILS

STR

Diagnosis

This appears to be fixed, not sure how but prob in fixing other routing issues.

mitra42 commented 4 years ago

Static Files

Files served by dweb.archive.org (inc archive.html, languages, includes etc) see below

Cleanup

mitra42 commented 4 years ago

remove /archive/ forwards from DM.mirrorHttp

mitra42 commented 4 years ago

Consolidate dweb2 to use all new microservices

mitra42 commented 4 years ago

Rip out python code

Python code is now gone .... both www-dweb.dev.archive.org and dweb.archive.org run on Javascript express servers now.

mitra42 commented 4 years ago

dweb2 as dweb - debugging

dweb repo

From kubectl logs -n$NS pod/$POD

mitra42 commented 4 years ago

ROUND 4 - conclusion

Status

Current oustanding issues will get their own issues.