-
Have not yet found a way to consistently do this via JavaScript. Same data from Htrix WARCs return hex-like values from UNIX shasum but Htrix hashes have characters beyond this scope (e.g., "M"). The …
-
Could it be possible for memex to store a [WARC file](https://en.wikipedia.org/wiki/Web_ARChive) of a bookmarked site? It would be very useful to keep a permanent record of bookmarks so that it's poss…
-
I built a thing that tests a warc for standards conformance. The cli is similar to "warcio check". It's 440 lines of code so far, likely to be around 1,000 when done.
It will need an extended testi…
-
Hi,
I'm looking for a way to integrate with a library of low poly models offline (for privacy concerns and to give reliable workshops) to facilitate in VR content creation. For now I have manually …
-
If for some resources the crawler encounters a ZIM file on a web property, we should immediately block it so that it is not included inside the WARC and then inside the ZIM.
This is probably a page…
-
340 WARC files of the news crawl data set, starting from 2020-09-12 until 2020-10-04 have been captured using [HTTP/2](https://en.wikipedia.org/wiki/HTTP/2) after a [Java security upgrade](https://mai…
-
archived photos from old webshots app found and unzipped. now on .warc file - how do I open this?
I tried to copy the file but got your message ''We don't support that file type//
-
I would prefer to have a settings menu where I can specify the location that the _warc_cache folder is stored, i.e. `~`, `/Volumes/Website\ Archives`, `G:\Data\Websites`.
Additionally, I would like…
-
Compare and contrast the resulting WARC files on the `https://odu.edu/compsci` URI generated by any two of the following tools:
* [Wget](https://www.gnu.org/software/wget/manual/wget.html#index-WARC)…
-
### Browsertrix Cloud Version
v1.9.3-79a217b
### What did you expect to happen? What happened instead?
I have found some new WARC fields and files in the newest WACZ from beta.browsertrix release: …