issues
search
webrecorder
/
cdxj-indexer
CDXJ Indexing of WARC/ARCs
Apache License 2.0
21
stars
10
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Fix typo in README
#27
machawk1
opened
3 weeks ago
0
Relax constraint in idna < 3
#26
benoit74
opened
1 month ago
0
Upgrade codebase to recent changes in Python ecosystem
#25
benoit74
opened
4 months ago
2
DepreciationWarnings in `pyamf`
#24
benoit74
opened
7 months ago
0
'cgi' is deprecated and slated for removal in Python 3.13
#23
benoit74
opened
7 months ago
1
Revisit records with POST requests lack a POST append in their URL key
#22
ARiedijk
opened
1 year ago
0
Ways of handling problematic WARC records
#21
anjackson
opened
1 year ago
1
SURT are not created for HTTP CONNECT requests in WARC file
#20
ARiedijk
opened
1 year ago
0
collection records + sort optimizations
#19
ikreymer
closed
2 years ago
2
Guard against record's without http
#18
edsu
closed
2 years ago
6
--post-append and memory use
#17
edsu
closed
2 years ago
3
AttributeError: 'NoneType' object has no attribute 'protocol'
#16
edsu
closed
2 years ago
0
Error during indexing: No space left on device
#15
edsu
closed
2 years ago
2
More fixes for JSON parsing
#14
ikreymer
closed
3 years ago
0
post-append json: support top-level list as well as dict, only includ…
#13
ikreymer
closed
3 years ago
0
Support for record and compressed block digest + POST append compatibility with pywb (1.4.0)
#12
ikreymer
closed
3 years ago
0
Extracting page titles / URLs from cdxj
#11
jakebickford
opened
3 years ago
1
by default, skip metadata/resource records that have 'application/war…
#10
ikreymer
closed
3 years ago
0
post request indexing improvements
#9
ikreymer
closed
3 years ago
0
Problem when URL is malformed
#8
PedroG1515
opened
3 years ago
0
Feature Requests / questions on use --> Pipe, Readme
#7
jwest75674
opened
4 years ago
2
Fix repo URL in setup
#6
machawk1
closed
4 years ago
0
Develop->Master for 1.1.0
#5
ikreymer
closed
4 years ago
0
Recompress and Re-indexing Errors
#4
logpanic
opened
4 years ago
0
Mimetype space
#3
nlevitt
closed
4 years ago
0
CDX files generated are not sorted
#2
thomaspreece
opened
6 years ago
3
Has this been pushed to pypi?
#1
machawk1
closed
7 years ago
1