We have some checks in HTML rewriting to fail the scraper if unexepected HTML is encountered.
It would be however very useful to be able to failsafe on these checks on-demand, so that one can scrape a whole property and get a whole list of warnings on issues discovered, rather than discovering them one-by-one. It will for instance help to run a full scrape of geo.libretexts.org despite https://github.com/openzim/mindtouch/issues/71 not yet being implemented.
We have some checks in HTML rewriting to fail the scraper if unexepected HTML is encountered.
It would be however very useful to be able to failsafe on these checks on-demand, so that one can scrape a whole property and get a whole list of warnings on issues discovered, rather than discovering them one-by-one. It will for instance help to run a full scrape of geo.libretexts.org despite https://github.com/openzim/mindtouch/issues/71 not yet being implemented.