China-Digital-Times-CDT / wepreserve

let's preserve the content that matters
https://saveHKonline.org
1 stars 3 forks source link

Direct linking to replayweb.page loading WACZ files #26

Open ikreymer opened 2 years ago

ikreymer commented 2 years ago

Just learned about your project. We (Webrecorder) are happy to help if you have any questions / requests.

I wanted to mention that it should be possible to link directly to WACZ files on IPFS so that they load without having to download the whole file, eg: https://replayweb.page/?source=https%3A%2F%2Fbafybeighmfirp4em5q25z7uselszxz2573ipxchwsdwvdkl7ldnsubwy54.ipfs.dweb.link%2Ffixtures%2Fhkfp-05_03_2022.wacz#view=pages

Currently, this may be slow, and but looking at ways to make this more reliable, so that users can view directly in the browser w/o downloading large files!

ikreymer commented 2 years ago

You can also link to a different gateways, such as: https://replayweb.page/?source=https%3A%2F%2Fipfs.io%2Fipfs%2Fbafybeighmfirp4em5q25z7uselszxz2573ipxchwsdwvdkl7ldnsubwy54%2Ffixtures%2Fhkfp-05_03_2022.wacz#view=info&url=https%3A%2F%2Fhongkongfp.com%2F&ts=20220503152907 which should also work.

gulprun commented 2 years ago

This sounds perfect, thanks for confirming the method we hesitated before. We were struggling with choosing which gateway to link as well because of the ephemeral of many tools/services, however, it's clear we should give end-users a playable link and maintain it at our side rather than leaving them puzzles.

Just learned about your project. We (Webrecorder) are happy to help if you have any questions / requests.

Thanks again, @ikreymer , we do have a thirst to access the cloud version of rowsertrix-crawler if you are in the same team. Currently, we are using the docker version but are pretty much unmanageable for its scope(we are monitoring over 50+ sites with different situations).