issues
search
openzim
/
zimit
Make a ZIM file from any Web site and surf offline!
GNU General Public License v3.0
262
stars
22
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
step 4 fails, GPG keys error..
#332
TimSousa1
closed
23 hours ago
2
Crawler is not returning full seed page URL in WARC `WARC-Target-URI`
#331
benoit74
opened
3 days ago
0
Implement daily automated testing of Youtube player
#330
benoit74
opened
4 days ago
2
Is HTTP2 an issue?
#329
rgaudin
opened
4 days ago
1
Unable to find WARC record for main page
#328
rgaudin
closed
4 days ago
2
CK12 website fails to be crawled properly
#327
benoit74
opened
5 days ago
0
Upgrade to crawler 1.2.0
#326
benoit74
closed
5 days ago
1
Release 2.1.0
#325
benoit74
opened
5 days ago
0
Release 2.0.3
#324
benoit74
closed
5 days ago
0
Youtube videos are not working anymore
#323
benoit74
closed
5 days ago
19
Through zimit I created a zim of forums.gentoo.org but looks like not everything is packaged? Browser extension wants to open internal link as external :(
#321
vitaly-zdanevich
closed
6 days ago
3
Release 2.0.2
#320
benoit74
closed
1 week ago
1
Add support for exploration of `<area>` links and/or custom selectors
#319
benoit74
opened
1 week ago
0
Upgrade dependencies
#318
benoit74
closed
2 weeks ago
1
after createing a webiste clone ???
#317
spydaz
closed
2 weeks ago
1
Retrieve automatically the assets present in a `data-xxx` tag
#316
benoit74
opened
3 weeks ago
2
Lang is not passed to warc2zim
#315
benoit74
closed
2 weeks ago
1
Fix usage in README.md
#314
benoit74
opened
3 weeks ago
1
Add support to pass custom behaviors to the crawler
#313
benoit74
opened
3 weeks ago
0
Release 2.0.1
#312
benoit74
closed
2 weeks ago
0
Add unit testing
#311
benoit74
opened
3 weeks ago
0
Strip user-agent whitespaces and ignore empty user agents
#310
benoit74
closed
3 weeks ago
2
Fix `--waitUntil` crawler options
#309
benoit74
closed
3 weeks ago
1
Zimit2: "Results" on MDN pages are not shown
#308
benoit74
opened
4 weeks ago
0
Upgrade to Ubuntu Noble
#307
benoit74
opened
1 month ago
1
Many pages on `getbootstrap.com_en_all_2024-05` have erroneously exposed JavaScript (as visible text) appearing on the bottom of the page
#306
Jaifroid
closed
1 month ago
1
Crawler is not correctly checking disk size / usage
#305
benoit74
closed
2 weeks ago
1
Make a distinction between soft and hard limits
#304
benoit74
opened
1 month ago
2
Merge zimit and warc2zim repositories, Python packages and Docker images
#303
benoit74
opened
1 month ago
3
Add option to directly process WARC files
#301
benoit74
opened
1 month ago
0
wombatSetup.js from warc2zim is not up-to-date in dev image
#300
benoit74
opened
1 month ago
13
Invalid WARC Record
#299
rgaudin
opened
1 month ago
1
Add support for only crawling the website, not calling warc2zim
#298
benoit74
closed
1 month ago
1
Add option to only crawl website and not run warc2zim conversion
#297
benoit74
closed
1 month ago
6
Crawler error: Cannot convert argument to a ByteString
#296
benoit74
closed
1 month ago
3
[zimit1] scraper never exits
#295
rgaudin
closed
1 week ago
1
No output after quitting early
#294
Mitsunee
closed
2 months ago
3
--exclude question
#293
onexecute
closed
2 months ago
4
Change crawler default settings around userAgent and mobileDevice
#292
benoit74
closed
3 months ago
1
Zimit2: Youtube videos are not working everywhere
#291
benoit74
closed
2 months ago
8
Browsertrix Crawler is stopping on disk full while it is not full
#290
benoit74
closed
1 month ago
2
networkidle is no longer a valid waitUntil
#289
brandonocasey
closed
3 weeks ago
7
Add support for downloading the browser profile from a URL
#288
benoit74
opened
3 months ago
0
Enhance integration test to assert final content of the ZIM
#287
benoit74
opened
3 months ago
0
Upgrade to Python 3.12, upgrade Python dependencies and add hatch-openzim plugin
#286
benoit74
closed
3 months ago
3
Upgrade browsertrix crawler and remove redirect handling
#285
benoit74
closed
3 months ago
7
Upgrade to browsertrix crawler 1.0.0 beta
#284
benoit74
closed
3 months ago
7
solar.lowtechmagazine.com is very unstable
#283
benoit74
closed
1 week ago
5
URL is different in error message
#282
rgaudin
closed
4 months ago
2
Invalid leading whitespace in User-Agent header
#281
benoit74
closed
3 weeks ago
1
Next