issues
search
openzim
/
warc2zim
Command line tool to convert a file in the WARC format to a file in the ZIM format
https://pypi.org/project/warc2zim/
GNU General Public License v3.0
40
stars
5
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Refactor HTML rewriter class to make it more open to change and expressive
#343
benoit74
opened
23 hours ago
1
Fuzzy-rule for cheatography.com JS
#342
benoit74
opened
1 day ago
0
Make fuzzy-rule configurable with an external data source
#341
benoit74
opened
1 day ago
0
Revisit `WARC-Resource-Type` content or add a new header
#340
benoit74
opened
2 days ago
0
Fix logic of rewrite mode computation for cases raised in #326
#339
benoit74
closed
2 days ago
2
Exit with cleaner message when no entries are expected in the ZIM
#338
benoit74
closed
2 days ago
0
Exit with cleaner message when main entry is not processable
#337
benoit74
closed
2 days ago
0
Exit with cleaner message when no entries are expected in the ZIM
#336
benoit74
closed
2 days ago
0
Wabac fuzzy rules - update + process
#335
benoit74
closed
2 days ago
1
Fix major issue of MDN ZIM + small fixes
#334
benoit74
opened
3 days ago
0
Youtube player is not working when wombat is configured to not run inside a Service Worker
#333
benoit74
opened
3 days ago
0
ValueError: Incorrect HttpUrl scheme in value
#332
rgaudin
opened
4 days ago
1
LookupError: unknown encoding: unicode
#331
rgaudin
opened
4 days ago
4
der-postillon.com recipe probably needs fuzzy rule(s)
#330
benoit74
opened
4 days ago
0
Add support for MediaSource requests
#329
benoit74
closed
4 days ago
2
Remove the DS rewriting rules
#328
benoit74
opened
5 days ago
3
"Results" sections of developer.mozilla.org (MDN) are not showing up
#327
benoit74
opened
1 week ago
0
Some resources rewrite mode are not correctly identified
#326
benoit74
closed
2 days ago
4
Fuzzy rule probably needed for hackteria.org
#324
benoit74
opened
1 week ago
0
fonts.google_en is failing to produce working ZIM with Zimit2
#323
benoit74
opened
1 week ago
0
Log is full of messages of "expected" missing entries which matches the include/exclude/scopeType
#322
benoit74
opened
1 week ago
0
Add option to specify content header length
#321
benoit74
closed
1 week ago
2
Add option to specify how characters to consider when searching charset in content header
#320
benoit74
closed
1 week ago
0
Add option to ignore charsets found automatically
#319
benoit74
closed
1 week ago
4
Add options to ignore charsets found automatically
#318
benoit74
closed
1 week ago
0
Release 2.0.2
#317
benoit74
closed
1 week ago
0
Double slash in query string (and path) are not working properly
#316
benoit74
opened
1 week ago
1
Use mimetype to selectively rewrite only html documents
#315
benoit74
closed
2 weeks ago
1
Fix detection of encoding again
#314
benoit74
closed
1 week ago
8
Rewriting logic is trying to rewrite PDFs as HTML document
#313
benoit74
closed
2 weeks ago
0
Automated encoding detection is still not working properly
#312
benoit74
closed
1 week ago
1
Release 2.1.0
#311
benoit74
opened
2 weeks ago
0
Upgrade deps + fix rewrite mode
#310
benoit74
closed
2 weeks ago
2
Cleanup: remove warning log about differing rewrite mode
#309
benoit74
opened
2 weeks ago
1
Fix support of non-GET (POST, PUT, ...) requests rewriting
#308
benoit74
opened
2 weeks ago
3
Fix support for JSONP rewriting
#307
benoit74
opened
2 weeks ago
0
Detect content type based on WARC-Resource-Type
#306
benoit74
closed
2 weeks ago
2
Enhance maintainability and accuracy of HTML rewriting rules
#305
benoit74
opened
2 weeks ago
0
Add support for multiple ZIM langs + validate Language metadata
#304
benoit74
closed
2 weeks ago
2
Set correct charset in HTML documents
#303
benoit74
closed
2 weeks ago
1
Use same automatic encoding detection for all contents
#302
benoit74
closed
2 weeks ago
1
Automatic detection of encoding not used for JS, JSON (and CSS) files
#301
benoit74
closed
2 weeks ago
1
Add support for multiple languages in `--lang` / ZIM metadata
#300
benoit74
closed
2 weeks ago
7
Drop integrity attribute in HTML `<script>` and `<link>` tags
#299
benoit74
closed
3 weeks ago
4
Drop `integrity` attribute in `<script>` HTML tags
#298
benoit74
closed
3 weeks ago
4
Release 2.0.1
#297
benoit74
closed
2 weeks ago
0
Use Warc-Resource-Type header to decide how to rewrite a WARC record
#296
benoit74
closed
2 weeks ago
4
Corrupted unsorted chunks error at the end of ZIM creation
#295
benoit74
opened
3 weeks ago
1
Do not indicate to wombat that we are running inside a SW since we are not
#294
benoit74
closed
3 days ago
4
Zimit2: HTML demos on developer.mozilla.org (MDN) pages are not working
#293
benoit74
opened
4 weeks ago
5
Next