issues
search
openzim
/
zimit
Make a ZIM file from any Web site and surf offline!
GNU General Public License v3.0
335
stars
24
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Automate daily tests of ZIM behavior - Youtube only for now
#351
benoit74
closed
1 month ago
0
Add link to the FAQ in README
#350
kelson42
closed
2 months ago
0
Intelligent crawling by resource types to respect maximum file size
#349
fluidicice
opened
2 months ago
5
Add test checking that expected entries are present
#348
benoit74
closed
1 month ago
0
Fix README and Dockerfile for imprecisions
#347
benoit74
closed
1 month ago
0
Add support for custom behaviors configuration
#346
benoit74
closed
1 month ago
0
Make it clear that --profile argument can be an HTTP(S) URL
#345
benoit74
closed
1 month ago
0
How to perform incremental scrapping of websites ?
#344
nish2482
opened
2 months ago
2
Puppeteer `INTERNAL ERROR`
#343
benoit74
opened
2 months ago
0
Release 2.0.4
#342
benoit74
closed
2 months ago
0
Upgrade to Browsertrix Crawler 1.2.4
#341
benoit74
closed
2 months ago
0
Are search bars standardized code (and thus automatically removable)?
#340
Popolechien
opened
2 months ago
1
Automatically add exclusion rules based on `robots.txt`
#338
benoit74
opened
2 months ago
0
zimit.kiwix.org: all-guitar-chords.com- Parts of zim working
#337
Rexadev
opened
2 months ago
5
zimit.kiwix.it: gtdb.org not working
#336
Rexadev
opened
2 months ago
1
zimit.kiwix.it: jguitar.com not working
#335
Rexadev
closed
2 months ago
1
videos fail to play on Radio zamaneh
#339
Popolechien
opened
2 months ago
1
creating a zim from a website that host images on https://imageshack.com/
#334
kroryan
opened
3 months ago
2
How to scrape large websites in a reasonable manner
#333
benoit74
opened
3 months ago
0
step 4 fails, GPG keys error..
#332
TimSousa1
closed
3 months ago
2
Crawler is not returning full seed page URL in WARC `WARC-Target-URI`
#331
benoit74
closed
2 months ago
1
Implement daily automated testing of Youtube player
#330
benoit74
closed
1 month ago
2
Some websites are raising HTTP2 errors on sisyphus worker
#329
rgaudin
opened
3 months ago
5
Unable to find WARC record for main page
#328
rgaudin
closed
3 months ago
2
CK12 website fails to be crawled properly
#327
benoit74
opened
3 months ago
0
Upgrade to crawler 1.2.0
#326
benoit74
closed
3 months ago
1
Release 2.1.0
#325
benoit74
closed
1 month ago
0
Release 2.0.3
#324
benoit74
closed
3 months ago
0
Youtube videos are not working anymore
#323
benoit74
closed
3 months ago
19
Through zimit I created a zim of forums.gentoo.org but looks like not everything is packaged? Browser extension wants to open internal link as external :(
#321
vitaly-zdanevich
closed
3 months ago
3
Release 2.0.2
#320
benoit74
closed
3 months ago
1
Add support for exploration of `<area>` links and/or custom selectors
#319
benoit74
opened
3 months ago
0
Upgrade dependencies
#318
benoit74
closed
3 months ago
1
after createing a webiste clone ???
#317
spydaz
closed
3 months ago
1
Retrieve automatically the assets present in a `data-xxx` tag
#316
benoit74
closed
2 months ago
3
Lang is not passed to warc2zim
#315
benoit74
closed
3 months ago
1
Fix usage in README.md
#314
benoit74
closed
1 month ago
1
Add support to pass custom behaviors to the crawler
#313
benoit74
closed
1 month ago
0
Release 2.0.1
#312
benoit74
closed
3 months ago
0
Add unit testing
#311
benoit74
opened
3 months ago
0
Strip user-agent whitespaces and ignore empty user agents
#310
benoit74
closed
3 months ago
2
Fix `--waitUntil` crawler options
#309
benoit74
closed
3 months ago
1
Zimit2: "Results" on MDN pages are not shown
#308
benoit74
opened
4 months ago
1
Upgrade to Ubuntu Noble
#307
benoit74
closed
3 weeks ago
1
Many pages on `getbootstrap.com_en_all_2024-05` have erroneously exposed JavaScript (as visible text) appearing on the bottom of the page
#306
Jaifroid
closed
4 months ago
1
Crawler is not correctly checking disk size / usage
#305
benoit74
closed
3 months ago
1
Make a distinction between soft and hard limits
#304
benoit74
opened
4 months ago
2
Merge zimit and warc2zim repositories, Python packages and Docker images
#303
benoit74
opened
4 months ago
3
Add option to directly process WARC files
#301
benoit74
closed
1 month ago
0
wombatSetup.js from warc2zim is not up-to-date in dev image
#300
benoit74
opened
4 months ago
13
Previous
Next