scrapy / scrapely

A pure-python HTML screen-scraping library
1.86k stars 315 forks source link

Provide method for parsing HTML that has already been downloaded by external libraries. #43

Closed louist87 closed 10 years ago

louist87 commented 10 years ago

While this library seems very appealing, the fact that Scraper.scrape makes blocking IO calls is a problem for those of us who would like to use it with an asynchronous framework such as Twisted.

It would be nice to have a Scraper function that takes a string of HTML and parses it, thus allowing the user to avoid blocking calls.

louist87 commented 10 years ago

Looks like I was using an old version on pypi. Apologies!