-
GECCO can't be installed in a Python 3.10 environment. This is actually caused because [`python-crfsuite` won't build in Python 3.10](https://github.com/scrapinghub/python-crfsuite/issues/130), so I'm…
-
When install `master` branch version and try to run a simple spider, we got a `ModuleNotFoundError` complaining about `scrapinghub` module. This module is necessary if we are running our spiders in Sc…
-
### Background
Currently, the `@override_for` decorator only works for top-level domains. However, there are some cases wherein a site has multiple subdomains, usually per country _(where each has …
-
Built docker image using: docker build -t airsenal .
Then ran container using: docker run -it --rm -v airsenal_data:/tmp/ -e "FPL_TEAM_ID=xxxxxx" airsenal airsenal_run_pipeline
![image](https://us…
-
### System Information
OS: Ubuntu 18.04
Dockerfile: https://hub.docker.com/layers/scrapinghub/crawlera-headless-proxy/1.2.2/images/sha256-e0b3055dbe9f6c35320dbfa275e43fa03e54727f4d9f99f70333da779bae…
-
In the last day or two version `2022.3.15` of the python [regex](https://pypi.org/project/regex/#history) library was put out. [`dateparser`](https://pypi.org/project/dateparser/) depends on that libr…
-
Hi there. I'm looking to install this on Python 3.10, but I'm getting an error message when attempting to build locally. Details included below:
More info
```console
root@9fa684e31e01:/# python…
-
https://github.com/scrapy/scrapy/releases/tag/2.6.1
-
Hi there 👋
First, thank you very much for this great library!
I'm having the following exception while using `regex` along with the `dateparser` library:
```python
raise error("bad escape \\%s"…
-
This issue aims to discuss adding new fields in addition to the existing `html` and `url` as proposed by @gatufo:
- cookies
- headers
- status_code