-
I've got a Facebook archiver working by using the `wacz_enricher.py`
https://github.com/djhmateer/auto-archiver/blob/v6-test/src/auto_archiver/enrichers/wacz_enricher.py#L159
Am using a stored p…
-
I have been trying out local deployments of Browsertrix Cloud with microk8s and would find it helpful if I could configure a local storage path to where WACZ/WARCs are written for crawls, so I can put…
-
Hi,
we are try to crawl a site that use s with Javascript based navigation instead of links of tags. The JavaScripts code controlling the behavior of the buttons is hosted on a different domain (CD…
-
I've created wacz file from warc.gz with latest py-warcz package 0.4.5
Original file https://cdn1.ruarxive.org/public/webcollect2022/ngo2022/cafrussia.ru/cafrussia.ru.warc.gz (179MB)
Produced WACZ …
ivbeg updated
2 years ago
-
- [ ] WARC to CAR library
- [ ] WACZ to CAR library (with embedded WARC chunking_)
- [ ] Add code to export WACZ from crawler to CAR
- [ ] Upload CAR to IPFS with auto-js-ipfs
- [ ] Look at splitt…
-
Good morning!
Just an idea. As early adapters of the WACZ format we were thinking that it could be nice in the future to have a specific way we should identify WACZ (before any processing). Request…
-
Trying to archive a youtube video page, but quality is always very low even though the video is available in all kind of resolutions.
Is there a way to force a higher resolution? or is this an issue …
-
Hello, I love this project!!! I was reviewing the verification mechanisms for WACZ files at https://specs.webrecorder.net/wacz-auth/0.1.0/#proof-of-authenticity, and noticed a peculiar wording in thei…
-
Confirm that IPFS wacz files work
-
Just learned about your project. We (Webrecorder) are happy to help if you have any questions / requests.
I wanted to mention that it should be possible to link directly to WACZ files on IPFS so th…