-
I noticed https://github.com/wabarc/cairn is on the list, but it doesn't support WARC/WACZ. Should that at least be noted in-line?
-
Should start a specific section describing various use cases for WACZ
-
Hi!
When I found out about this project, its name made me think it was a tool to read [WARC files](https://en.wikipedia.org/wiki/Web_ARChive), which stands for... Web ARChives!
Is there support …
-
Some of the collection-based retrieval aspects of this specification are particularly interesting, like the ability to specify specific pageIDs of interest.
As you are very well aware, @ikreymer, W…
-
### Context
pages not loading as on the web.
### What change would you like to see?
A necessary enhancement to load pages normally as on the web not a white page through browsing.
when i go …
-
In the current setup, the crawl runs to completion, and can be scaled up and down (in K8s). If a pod fails, the crawl can be re-started if the volume is available (eg. a shared NFS). A crawl can also …
-
### ReplayWeb.page Version
2.0.2
### What did you expect to happen? What happened instead?
after archiving the contents of edx.org using the webrecorder extension on brave browser, ReplayWeb.…
-
I'm sorry for a non-descriptive title, but there's nothing more specific I can really say.
I attempted to load archive from https://archive.org/download/archiveteam_liveleak_20210506071950_2a306039…
-
@giancarlobi this is a place holder for the formatter need.
Right now its acting on WARC files, now that we have automatic WARC to WACZ transformations we need to adapt code to react to the fact th…
-
I generated some WARC files with [node-warc](https://www.npmjs.com/package/node-warc) and they are mostly fine except for the bookmarks/sidebar list of pages which isn't showing the date.
Interesti…
gjvnq updated
2 years ago