issues
search
internetarchive
/
warc
Python library for reading and writing warc files
GNU General Public License v2.0
237
stars
114
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
BEWARE
#38
dinoDayo
closed
4 months ago
0
KeyError: 'warc-target-uri'
#36
catharsis71
opened
3 years ago
0
Entirely read gzipped WARC files
#35
sebastian-nagel
closed
1 year ago
0
Unsupported WARC version: 1.1
#34
kiska3
opened
3 years ago
2
Update warc.py
#33
Marcos9988
opened
4 years ago
0
ModuleNotFoundError: No module named '__builtin__'
#32
Chertoganov
opened
5 years ago
3
Gzipped ARCs are now supported.
#31
jordan-dlh
opened
6 years ago
0
How to extract a record based on offset?
#30
kartheek7895
opened
6 years ago
0
WARC file is not compressed record by record
#29
kartheek7895
opened
6 years ago
0
fast seek() for multiprocessing
#28
meshiguge
opened
6 years ago
1
How to pass the WARC-Target-URI to a variable? eg:- f == record.header['WARC-Target-URI']
#27
Gautamshahi
opened
7 years ago
3
Changes to make library compatible with Python3.
#26
baali
opened
8 years ago
7
Python3 Compat
#25
jbrockmendel
opened
8 years ago
1
fix for python requests module >= 1.0.0 for from_response
#24
buckmaxwell
opened
8 years ago
0
Fix WARC writing bug
#23
jeffcasavant
opened
8 years ago
2
let's see what's new in this fork of the warc lib
#22
nlevitt
opened
8 years ago
0
.gz WARC files not properly read
#21
MrMagoffin
opened
9 years ago
11
Apparent issue using wget-created .warc.gz files
#20
ZoeB
opened
9 years ago
1
KeyError warc-target-uri
#19
vschiavoni
opened
10 years ago
3
Bugfix, sets license in the package classifiers to GPLv2
#18
IsaacHaze
opened
10 years ago
0
Update arc.py
#17
jayGattusoNLNZ
opened
11 years ago
0
Allow Digit Characters in Header Names
#16
gthole
opened
11 years ago
0
Basic warc-to-zip script and utility
#15
anjackson
closed
11 years ago
1
Fixing compatiblity with Requests>=1.0.0
#14
ersi
opened
11 years ago
0
WARC: from_response incompatible with Requests>=1.0.0
#13
ersi
opened
11 years ago
0
Arc: Reading file metadata (file header appendix)
#12
mishak87
opened
12 years ago
0
Arc: Added support for gzipped archive
#11
mishak87
opened
12 years ago
0
incomplete manual
#10
fanchyna
opened
12 years ago
3
Fix arc parser to work with Alexa ARC records
#9
nibrahim
closed
12 years ago
0
Quick creation of record with default values for headers
#8
nibrahim
opened
12 years ago
1
Create ARCRecord with version so that write_to can work without it
#7
nibrahim
closed
12 years ago
0
Implement browse functionality for arc files
#6
nibrahim
opened
12 years ago
0
Create ARC records with actual HTTP conversation
#5
nibrahim
closed
12 years ago
1
Arc support
#4
nibrahim
closed
12 years ago
0
Bug in WARCRecord.__repr__()
#3
petri
opened
12 years ago
1
support reading older WARC versions
#2
petri
opened
12 years ago
0
Fixed mistake in installation instructions
#1
nibrahim
closed
12 years ago
0