-
File "/usr/lib/python2.7/site-packages/pony/orm/dbapiprovider.py", line 53, in wrap_dbapi_exceptions
except dbapi_module.OperationalError as e: raise OperationalError(e)
pony.orm.dbapiprovider.Opera…
-
大佬,你的这个代码我要换个城市,的爬取,或者爬全国的,然后岗位也是爬全部的岗位我该如何处理呢修改那部分
-
Hi,
@nramirezuy and me were debugging memory issue with one of the spiders some time ago, and it seems to be caused by ImagesPipeline + [S3FilesStore](https://github.com/scrapy/scrapy/blob/master/sc…
kmike updated
8 months ago
-
I am crawling a web that when call a image it is created on the fly, it returns 201 (create) in response.status, but this is not saved because response.status != 200.
https://github.com/scrapy/scrapy…
-
### Current Behavior
I am getting a KeyError: b'X-Scrapoxy-Proxyname' when a proxy is blacklisted
### Expected Behavior
Not getting a KeyError
### Steps to Reproduce
1. My scrapoxy proj…
-
This issue refers to the documentation [here](https://github.com/scrapinghub/portia/blob/master/docs/installation.rst)
```
You can run Portia with the command below:
docker run -i -t --rm -v :/ap…
-
I did a test newscatcher run on prod and saw a whole lot of fetch failures. Two main types. described below. I've attached text files with lists of URLs for each. The task here is to investigate _why_…
-
Hi,
did you perform any benchmarks? How is it compared to, say, PhantomJS? In particular, CPU and memory consumption.
I'm asking because running effectively over 100 parallel phantomjs instances is …
-
Várias partes do Spider das Casas Bahia estão comentados porque veio adaptado do Scrapy e precisa ser debugado.
-
像爬取的text为空,还有就是添加三元组的时候attrs和values也是空的,所以加不到三元组里