-
I try to build proxy settings of splash. I assign Tor or Polipo port address to `set_proxy`, but it doesn't work. I get 504 error:
function main(splash)
local host = "localhos…
-
I found an example with which I can retreive only the specified meta from job items:
https://doc.scrapinghub.com/api/items.html?highlight=metadata#examples
This could be handy to parallelize downl…
-
Trying to build a custom docker-image with:
```
1. scrapy startproject test
2. scrapy genspider testSpider
3. shub image init
4. docker build . -t customnamehere
```
Dockerfile:
```
FROM sc…
-
Getting this error on scrapinghub. Works okay locally. Any ideas?
```
Traceback (most recent call last):
File "/usr/local/lib/python3.6/site-packages/scrapy/utils/defer.py", line 102, in iter_e…
-
It would be nice if there was an additional option to prefer the current date if available over the future or past one when parsing relative dates. This is primarily an issue when parsing dates in new…
-
Hi. I would like to work on the 'Add Grad-CAM support' project (http://gsoc2019.scrapinghub.com/ideas/#add-grad-cam-support) as a GSoC student. As agreed, I will try to familiarize myself with ELI5 by…
-
I'm using [AnyProxy](https://anyproxy.io/en/) and i can't use proxy on https endpoints with Splash, the authorization headers are not sended on **CONNECT** method, so i received **407 Proxy Authentica…
-
Every now and then the Travis build fails on [`tests.test_crawler.CrawlerRunnerTestCase.test_deprecated_attribute_spiders`](https://github.com/scrapy/scrapy/blob/1.7.3/tests/test_crawler.py#L173), I'm…
-
Fill in the missing docs for the following settings:
```
SPIDERMON_SPIDER_OPEN_EXPRESSION_MONITORS
SPIDERMON_SPIDER_CLOSE_EXPRESSION_MONITORS
SPIDERMON_EXPRESSIONS_MONITOR_CLASS
```
https://…
-
https://github.com/uBlockOrigin/uAssets/blob/master/filters/unbreak.txt contains extra rules which unbreak easylist rules.