warc Search Results - Githubissues

1000+ results
for warc

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

iipc/openwayback #22

Support WARC conversion records

**S'sheet line:** 5 **For whom?** BNF, BL, DN **Notes:** CDX/indexing consequences **Est. Milestone:** Ilya to check.

PsypherPunk updated 10 years ago
4
kanishka-linux/reminiscence #13

Support for WARC format

I just wanted to point out that there's a dedicated file format for archiving webpages called Web ARChive (WARC) [1]. It's an open standard used by libraries and afaik can also be uploaded to the wayb…

stonie08 updated 6 years ago
1
openzim/warc2zim #340

Revisit `WARC-Resource-Type` content or add a new header

See https://github.com/webrecorder/browsertrix-crawler/issues/630

benoit74 updated 3 months ago
1
iipc/warc-specifications #96

WARC-Resource-Type field possibilities (feedback wanted)

Browsers have different ways of reporting the 'resource type' for any resource that's being fetched. When using browser-based crawling, it is often easy to access this 'resource type' and store it in …

ikreymer updated 4 months ago
8
webrecorder/warcio #74

Incorrect WARC-Payload-Digest values when transfer encoding …

Per WARC/1.0 spec section 5.9: > The payload of an application/http block is its ‘entity-body’ (per [RFC2616]). The entity-body is the HTTP body *without transfer encoding* per [section 4.3 in R…

JustAnotherArchivist updated 7 months ago
37
webrecorder/replayweb.page #93

Unable to load a big WARC (18GB)

Hi, I'm trying to read a very big WARC, of 18GB as I said in the title, and using the desktop version for Windows, the load stops (in fact, the app only change to a white screen, with the menus and th…

Scorpin updated 5 months ago
5
TechAndCheck/zenodotus #121

Add WARC support where appropriate

WARC is an archive standard that's used by the internet archive and others (including our German friends). The main info on it is here: https://www.loc.gov/preservation/digital/formats/fdd/fdd000236.s…

cguess updated 2 years ago
2
webrecorder/webrecorder-player #86

Player is stuck loading WARC

Using the latest version ![image](https://user-images.githubusercontent.com/29717217/68475614-a00fc180-0239-11ea-98d7-92ac3d4ca0f5.png) ![image](https://user-images.githubusercontent.com/29717217/68…

MadRatSRP updated 5 years ago
1
machawk1/warcreate #107

URIs with invalid characters are not escaped

In some places on the web, invalid URIs may be used to identify resource representations. For example, at one point (perhaps still) Google Fonts recommended values like `https://fonts.googleapis.com/c…

machawk1 updated 6 years ago
1
netarchivesuite/solrwayback #356

Make binary resolving more flexible

The current abstraction of resource (WARC records) resolving expects `[WARC-filename, offset]`. By extending this to `[WARC-filename, offset, timestamp, URL]` it should be possible to use PyWB as back…

tokee updated 1 year ago
2

上一页 1...6 7 8 9 10 11 12...100 下一页

1000+ results for warc

1000+ results
for warc