Closed Rexadev closed 4 months ago
Zimit uses Browsertrix crawler which is a static crawler, i.e. it captures everything which is fetched to load a given page, and from here looks at links available on the page and crawl them, and so on until there is no more link to explore.
Result work well only if all links are explorable from static HTML.
Here you use a functionality which is generating a web request dynamically. This link has never been explored during the crawl, so this is not going to be present in the WARC/ZIM so this is not going to work.
With expertise you could develop a custom browsertrix behavior which might solve this issue by exploring the various chords for you. This is both not an easy feat (you need to be a JS programmer) and there is no guarantee it will be sufficient.
Closing the issue since this is a known limitation and software will probably never be adapted to solve such cases automatically.
https://s3.us-west-1.wasabisys.com/org-kiwix-zimit/other/jguitar.com_d4664352.zim
https://github.com/openzim/zimit/assets/62152714/917eacc4-8561-445e-872b-fb7714c48a40
I wrote this but I didn't use