scrapy / parsel

Parsel lets you extract data from XML/HTML documents using XPath or CSS selectors
BSD 3-Clause "New" or "Revised" License
1.15k stars 146 forks source link

Parsel vs. Scrapy? Which should I install? #219

Closed pythonic2020 closed 3 years ago

pythonic2020 commented 3 years ago

Hello, I am new to web scraping in Python, and just installed parsel for extracting and removing data from HTML and XML using XPath and CSS selectors, optionally combined with regular expressions. Parsel seems to have everything I need, and works perfectly. I see now that parsel is listed on GitHub under scrapy. Since both packages are available on conda-forge, I am confused as to which package I should use (or both?), and what are the differences between the two. Is parsel a subset of scrapy, meaning that scrapy has much more functionality (or the other way around)? Should I uninstall parsel and install scrapy instead?

On this page I saw the following which suggests that parsel is the base library upon which scrapy operates:

While these are not changes in Scrapy itself, but rather in the parsel library which Scrapy uses for xpath/css selectors, these changes are worth mentioning here. Scrapy now depends on parsel >= 1.5, and Scrapy documentation is updated to follow recent parsel API conventions

Thank you in advance. Sorry for the potentially stupid question.

wRAR commented 3 years ago

This is an issue tracker, not a place to ask questions.