issues
search
webrecorder
/
warcio
Streaming WARC/ARC library for fast web archive IO
https://pypi.python.org/pypi/warcio
Apache License 2.0
387
stars
58
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Handle deprecation of naive datetime functions like utcnow()
#185
tw4l
closed
2 weeks ago
4
feat: try py 3.13, plus typos
#184
wumpus
closed
3 weeks ago
0
Stream Recompressor
#183
white-gecko
opened
2 months ago
2
Add docs and https://warcio.readthedocs.io
#182
Florents-Tselai
opened
2 months ago
0
py3.12 and setuptools
#181
wumpus
opened
2 months ago
6
feat: test old ubuntu version
#180
wumpus
closed
3 months ago
0
doc: document how to use brotli; test brotli
#179
wumpus
closed
3 months ago
0
feat: add darwin and windows CI
#178
wumpus
closed
3 months ago
1
feat: try darwin and windows [skip actions]
#177
wumpus
closed
3 months ago
0
chore: finish py3.12
#176
wumpus
closed
3 months ago
0
Test python 3.12
#175
white-gecko
closed
3 months ago
10
Remove superfluous ci step
#174
white-gecko
closed
3 months ago
1
Add very simple test for version argument and use importlib feature instead of deprecated pkg_resources for version
#173
white-gecko
closed
3 months ago
5
Run pytest directly. setup.py test was removed in setuptools 72.
#172
white-gecko
closed
3 months ago
5
Update codecov/codecov-action from v1 to v4
#171
white-gecko
closed
3 months ago
0
Adjust classifiers to the actually tested build matrix
#170
white-gecko
closed
3 months ago
1
Migrate from setup.py to poetry/pyproject.toml
#169
white-gecko
opened
3 months ago
15
Add dependency for setuptools, which is required by cli get_version command
#168
white-gecko
closed
3 months ago
5
Bump urllib3 from 1.25.11 to 1.26.19
#167
dependabot[bot]
closed
3 months ago
1
Bump urllib3 from 1.25.11 to 1.26.18
#166
dependabot[bot]
closed
6 months ago
2
Add test to HTTPS proxies
#165
tw4l
opened
6 months ago
0
Migrate to GitHub Actions CI and resolve dependency issues
#164
tw4l
closed
6 months ago
1
DeprecationWarning: datetime.datetime.utcnow() is deprecated and scheduled for removal in a future version
#163
benoit74
closed
2 weeks ago
8
warcio recompress adds "WARC-Payload-Digest" to records without understanding them
#162
acidus99
opened
10 months ago
0
warcio recompress adds WARC-Block-Digest fields to records without one
#161
acidus99
opened
10 months ago
0
Fix typos discovered by codespell
#160
cclauss
closed
8 months ago
0
Delete .travis.yml because Travis CI is no longer free
#159
cclauss
closed
6 months ago
1
"warcio check" does not warn of illegal characters in field names or values, including LF
#158
acidus99
closed
1 year ago
8
warcio accepts a bare LF everywhere a CRLF is required by the spec
#157
acidus99
closed
1 year ago
1
"warcio check" incorrectly reporting payload digest failures for non-HTTP WARCs
#156
acidus99
opened
1 year ago
2
doc bugs linking to source code files
#155
wumpus
opened
1 year ago
0
Deimos/add https type
#154
Deimos4Flare
closed
1 year ago
0
Add support for the 1995 NCSA 1.5.1 webserver
#153
omgoo
opened
1 year ago
4
wget warc status code?
#152
johnmaguire
closed
1 year ago
3
webrecorder fails to open IA warc file on MacOS X Ventura 13.2.1
#151
theopathic
closed
1 year ago
2
warcio cannot write wet files
#150
mraslann
opened
2 years ago
0
Patching WARCs using warcio
#149
wsdookadr
opened
2 years ago
0
GitHub Action to lint Python code
#148
cclauss
closed
10 months ago
0
Trying to write to closed file when using `requests.Session`
#147
maxyousif15
opened
2 years ago
0
Empty WARC files when deploying warcio on Airflow
#146
maxyousif15
closed
2 years ago
5
fix utf-8 encoding
#145
tomeksporczyk
opened
2 years ago
2
warcio.exceptions.ArchiveLoadFailed: Unknown archive format
#144
KyloPrem
closed
2 years ago
3
Documentation: Clarify that capture_http writer with filename has no get_stream methood
#143
voltagex
opened
2 years ago
3
Issues with encoding of http-answers
#142
Weyaaron
closed
2 years ago
2
Warcio does not support replay of sites hosted on NCSA 1.5
#141
omgoo
opened
2 years ago
3
Record not followed by newline (conversion error)
#140
mw0000
opened
2 years ago
1
`capture_http` fails in tests, but works otherwise
#139
maxyousif15
closed
3 years ago
5
warcio check does not raise error when GZip records are truncated
#138
anjackson
opened
3 years ago
5
extract entire warc file?
#137
catharsis71
closed
2 years ago
4
CLI Indexer: silently ignore brokenpipe signal
#136
sebastian-nagel
opened
3 years ago
5
Next