openzim / mindtouch

libretexts.org to ZIM scraper
GNU General Public License v3.0
0 stars 0 forks source link

Should we apply head and tail from page content? #10

Open benoit74 opened 3 weeks ago

benoit74 commented 3 weeks ago

In addition to MathJax #9, we have other "things" in head and tail attributes of page content.

Do we need / want to include these?

rgaudin commented 3 weeks ago

Haven't looked in detail but I think that if those are attributes to some node and that you ask the question then there is no clear usage and thus it should not be included. I understand this scraper is a standard scraper: ie. we build up our ZIM with data we scraped ; in opposition to zimit where we pass data from website to ZIM with some transformation

benoit74 commented 2 weeks ago

Intention so far is indeed to better understand what is in these attributes and if there is a usage / use-case.