-
The crawling infrastructure is now generic enough and will be use to Webrecorder as part of next-generation Browsertrix Core setup, that runs in a single container. The component can move to the Webre…
-
I just zimmed up a wordpress blog with 186 articles (cutoff at 1,000) and about 500 images (https://mesquartierschinois.wordpress.com). Standard, free wordpress, ie no funky extension added.
I woul…
-
## Describe the bug
When indexing a WARC file with records containing `Content-Type: multipart/form-data` (missing "boundary" such as in `multipart/form-data; boundary=----WebKitFormBoundaryrdRXu11…
-
## Wiki Page URL
https://github.com/pirate/ArchiveBox/wiki/Web-Archiving-Community
## Suggested Edit
Greetings,
The page listed above needs some link/info corrections regarding the recent chan…
-
Hi,
Importing Gzip compressed WARC files works very well now. @Orbiter thank you very much for the fix. But in the case of some WARC archives only the first entry is processed. The log shows the fo…
-
Recently, in the past two weeks or so in QA PyWb, I've noticed many sites that resolve to a non-existent /null
https://www.webarchive.org.uk/act/wayback/archive/20210321105502/http://www.westyorksh…
-
Hello,
I am using a software solution at work that uses OpenWayback for rendering (displaying) .warc files. However, some websites I recently archived cannot be displayed. Some problems appear to …
-
Occasionally, QA PyWb will have difficulty playing back an instance. The error will look like this in QA PyWb:
![image](https://user-images.githubusercontent.com/18530934/97594812-f2be2c80-19fa-11eb-…
-
Hi,
Some content is blocked on the desktop app, and also on the Chrome browsers on the site. On webrecorder.io if you select one of the Firefox browsers there's an in-browser option to "Disable pro…
-
@OfficialEsco @jedieaston @ItzLevvie Can you all please suggest some packages which are shipped through GitHub releases so that I can them in packages.json?