ulixee / hero

The web browser built for scraping
MIT License
652 stars 32 forks source link

Collect #155

Closed calebjclark closed 1 year ago

calebjclark commented 1 year ago

This PR pulls over many of the collect/extract features of DataboxForHero.

Added support for element.$collect (and nodeList.$collect).

Added support for resource.$collect.

Hero now has a hero.collect() method as well as hero.collectedResources, hero.collectedElements, and hero.collectedSnippets.

Added new HeroOld class for initializing a prior session to get collectedElements, etc. It implements the bare minimum needed by DataboxForHereo -- we can expand later.

Documentation coming in a subsequent PR.