wacz Search Results - Githubissues

399 results
for wacz

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

webrecorder/archiveweb.page #30

Add a smart dedupe feature

If one is recording a site of video content (especially video content which repeats upon, say, a reload or clicking on the link again), the files become huge. Having the ability to intelligently dedup…

deltabravozulu updated 3 years ago
2
webrecorder/replayweb.page #31

Displaying incorrect url when using a wacz file, works when …

Hi there, first of all thank you very much for this great tool and the idea of the wacz format so simplify the handling of warc files. I'm not sure if this is the correct project to report this to…

cfeddersen updated 3 years ago
5
frictionlessdata/frictionlessdata.io #588

Using frictionlessdata data package for web archive data pac…

I wanted to reach out to the frictionless data community share that we (https://github.com/webrecorder, https://webrecorder.net/) are working on a new packaging format to store web archive data, and a…

ikreymer updated 3 years ago
6
esmero/archipelago-docker-images #21

Alpine Busy Box Patch is broken + add WACZ to esmero-php

# What? How? Well https://www.busybox.net adds in small footprint a lot of basic utilities to a Docker based on Alpine. But guess what? They are "not" exactly the same as the standard GNU. E…

DiegoPino updated 3 years ago
1
webrecorder/browsertrix-crawler #10

Support Full-Text Extraction

Support text extraction from the DOM, using existing approaches implemented here: https://github.com/webrecorder/archiveweb.page/blob/main/src/recorder.js#L1061 https://github.com/webrecorder/browse…

ikreymer updated 3 years ago
1
webrecorder/replayweb.page #26

Version 1.2 and some guidance on setup/making the embed plug…

Hi @ikreymer @emmadickson I know you guys are busy with WACZ but wanted to catch up with some issues we have been having on the embed version of replay web on Archipelago with version 1.2 I sus…

DiegoPino updated 3 years ago
5
programminghistorian/jekyll #2030

Could perma.cc help PH keep weblinks sustainable?

I came across [perma.cc](https://perma.cc/) today and was wondering if it could be useful for the Programming Historian to ensure its weblinks are more 'permanent.' After chatting with @walshbr an…

hawc2 updated 2 years ago
50
webrecorder/replayweb.page #29

loading stalls on larger WARC files

I am trying to load files that are between 273 MB and 2 GB. I am using a Mac. However, the files stall halfway through when loading. I do not have this issue with small files. These are WARC files…

marklevit updated 3 years ago
3
webrecorder/replayweb.page #22

"No pages are defined in this archive" + no URL's

Hi, Some days ago I created a WARC file with Heritrix. Webrecorder Players discovers around 10.000 pages; replayweb 0. There certainly are pages and URL's in that WARC-file. Is this a bug? Or maybe…

nvanderperren updated 3 years ago
4
webrecorder/archiveweb.page #9

Can't download Web Archive

First, thanks a lot for publishing this extension, it makes archiving much more straightforward. I tried to download a Web Archive totalling 2.56 GB as a `wacz` file. The download starts but then g…

yamrzou updated 3 years ago
9

上一页 1...34 35 36 37 38 39 40...40 下一页

399 results for wacz

399 results
for wacz