-
I'd been running into the issue described here https://github.com/scrapinghub/extruct/issues/215
Adding `lxml>=5.0.0,
-
Stemming from https://github.com/scrapinghub/scrapy-poet/pull/111 where we'd want to implement the API in **web-poet** itself regarding extracting data from a subset of fields.
# API
The main di…
-
Support for socks5 proxy
http://www.ietf.org/rfc/rfc1928.txt
maybe we can use https://github.com/habnabit/txsocksx 's SOCKS5Agent
cydu updated
4 months ago
-
It seems that installing extruct via `pip install extruct` now automatically install lxml 5.2.11 which causes the following import error:
```
ImportError: cannot import name '_ElementStringResult'…
svoss updated
4 months ago
-
[Every](https://github.com/scrapy/scrapy/issues/2205) [now](https://github.com/scrapy/scrapy/issues/1858) and [then](https://github.com/scrapy/scrapy/issues/2730) we get a bug report about some HTML s…
-
Ubuntu 22.04, Python 3.10.9, Spidermon 1.17.1
If an item defined with `attrs` library and this condition https://github.com/scrapinghub/spidermon/blob/master/spidermon/contrib/scrapy/pipelines.py#L…
-
### Required prerequisites
- [X] I have searched the [Issue Tracker](https://github.com/camel-ai/camel/issues) and [Discussions](https://github.com/camel-ai/camel/discussions) that this hasn't alread…
-
The latest version available from PyPi (0.16.0) breaks when used with `lxml` greater than 5.1.0.
The reason is that `lxml` seems to have dropped some internal objects, like `lxml.etree._ElementStri…
-
Installed the library on Python 3.10 and 3.12 on my Linux Mint installation, and keep getting the following message every time I simply import the library
```
from recipe_scrapers import scrape_me…
-
From Debian amd64 unstable python 3.11.9
```
$ gourmand
args = Namespace(db_url='', threads=False, gourmanddir='', thread_debug_interval=5.0, thread_debug=False, debug_file='', time=False, debug=N…