inhumantsar / slurp

Slurps webpages and saves them as clean, uncluttered Markdown. Think Pocket, but better.
https://inhumantsar.github.io/slurp/
MIT License
127 stars 2 forks source link

Investigate alternatives to Obsidian+Turndown #36

Open inhumantsar opened 1 month ago

inhumantsar commented 1 month ago

Obsidian's implementation/wrapper of Turndown results in some pretty wonky Markdown at times.

It also doesn't take advantage of Obsidian's LaTeX capabilities, eg <img src="https://wikimedia.org/api/rest_v1/media/math/render/svg/f1bbad17b44d494864f74e02217ced4562645be6" aria-hidden="true" alt="{\displaystyle \mathrm {VO_{2}\;max} ={d_{12}-504.9 \over 44.73}}"> imports the image rather than the alt text.

It will be worth spending some time researching the pros/cons of alternative parsers or using Turndown directly and contributing patches upstream.

See also #34