Closed Ehsan-U closed 1 month ago
I was thinking about selectolax
selectolax doesn't support XPATH selector nor JMESPath for JSON.
We started out with BeautifulSoup
because of its popularity, but you're right that it has its shortcomings. Adding support for either selectolax or parsel as a new crawler type should be fairly easy - we'll consider it.
+1 for Parsel
@janbuchar, I'd like to help out by adding Parsel support as a new crawler type. Would you be open to a PR from me for this?
@janbuchar, I'd like to help out by adding Parsel support as a new crawler type. Would you be open to a PR from me for this?
Absolutely :slightly_smiling_face:
BeautifulSoup
lacks proper type hints, mostlyAny
type, hence not effective IDE autocompletion. A solid alternative is Parsel. It supports CSS selectors, XPath expressions for HTML and XML, JMESPath for JSON documents, and Regex expressions. Additionally,Parsel
is the parser used byScrapy
.