simonw / shot-scraper

A command-line utility for taking automated screenshots of websites
https://shot-scraper.datasette.io
Apache License 2.0
1.57k stars 70 forks source link

Auto scraper? #132

Open walking-octopus opened 6 months ago

walking-octopus commented 6 months ago

There's a neat little package autoscraper that allows to quickly build no-code web extractors.

I actually wonder the idea can be extended to also use data from the heap to try get the text out, especially given it's a lot messier than hunting for the selector.

May be prototyped as another CLI on top of heap, html, and image exporting here.