-
Improvements for 1.0.0 branch of crawler:
- Switch from using py-wacz to [js-wacz ](https://github.com/harvard-lil/js-wacz) for WACZ generation
- Pass in indexes from `/tmp-cdx` rather than reinde…
tw4l updated
3 months ago
-
@tnafrancesca Please can you add a bit of info and i'll put this in 6.7.0 board.
-
Just to formalise a comment from #887,
fmt/1840 is emerging as a leading format for web archiving. Structurally it is a zip file containing a JSON manifest file and other structural elements along …
-
### Browsertrix Version
v1.10.2-dc9069d
### What did you expect to happen? What happened instead?
The tv2.dk front page was crawled with brave archiveWeb chromium extension at Thursday 11:16 where …
-
### ReplayWeb.page Version
v2.0.0
### What did you expect to happen? What happened instead?
When trying to replay a WACZ in Firefox, I get this error message:
```
Sorry, this URL could not be l…
-
Every time signatures are verified in the code (proofmode, wacz) the public key needs to first be checked against a list of known good public keys. Otherwise there is no point to verifying the signatu…
-
Details about how to aggregate multiple WACZ files into a single WACZ need to be added to the specification. This hinges on resources in the `datapackage.json` using a `url` for a WACZ rather than a `…
-
### ReplayWeb.page Version
2.0.2
### What did you expect to happen? What happened instead?
after archiving the contents of edx.org using the webrecorder extension on brave browser, ReplayWeb.…
-
I created an archive of https://www.instagram.com/berniesanders/ while logged into Instagram, and not using autopilot. The page seemed to archive fine but when I went to replay it crashed Chrome!
Y…
-
### ReplayWeb.page Version
v1.8.15
### What did you expect to happen? What happened instead?
I archived facebook link (https://www.facebook.com/NBCNews) using Webrecorder ArchiveWeb.page exte…