scrapy-spider Search Results

1000+ results
for scrapy-spider

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

scrapy/scrapy #2711

Scrapy capitalizes headers for request

I'm setting the headers following way ```python headers = { 'accept': 'text/html,application/xhtml+xml,application/xml;q=0.9,image/webp,*/*;q=0.8', 'cache-control': 'no-cache', ... }…

kspiridonova updated 1 year ago
10
dataabc/weibo-search #1

使用情况反馈

大佬。是不是改这几个地方就可以使用了 'Accept': 'text/html,application/xhtml+xml,application/xml;q=0.9,*/*;q=0.8', 'Accept-Language': 'zh-CN,zh;q=0.9,en;q=0.8,en-US;q=0.7', 'cookie': 'your cookie' …

barnett2010 updated 4 years ago
23
Aqua-Dream/Tieba_Spider #32

AttributeError: 'Values' object has no attribute 'overwrite_…

✗ scrapy run Traceback (most recent call last): File "/usr/local/bin/scrapy", line 8, in sys.exit(execute()) File "/Users/noname/Library/Python/3.8/lib/python…

jjshare updated 2 years ago
12
scrapy-plugins/scrapy-deltafetch #29

How to stop if encountered visited links?

Hi! Is it possible to set deltafetch stop scrapy crawling when encountering a visited link? I really need this!

mosynaq updated 6 years ago
1
985557461/tianmaospider #1

No results in result.json

Hi there, I tried to run your scrapy script but there was no results. I have also created a SQL DB with table name goods_info but I'm still having issue. Can you help me out? Connect to db successful…

mirkathaha updated 8 years ago
10
scrapy/scrapy #5153

Restarting a crawl using with JOBDIR overwrites feed files t…

### Description A crawl with a feed file format `'feed-%(batch_id)s.jl'` will writes feed files `feed-1.jl`, `feed-2.jl`, but will overwrite those same files when restarting using the JOBDIR parame…

rec updated 3 years ago
4
Boris-code/feapder #153

有什么方法主动停掉爬虫吗?

就像scrapy的信号机制那样? 我的项目实际运行过程中有两个问题, 代理和token都有限制, 我需要在middleware中先判断两者是否有效, 都有效时修改url/headers/proxies等参数正常请求. 其中一个无效都需要停掉爬虫等待定时任务下次启动. 这个项目又用到batch_spider, 目前的解决方案有些复杂. 检测到代理或token失效时, 把task表中所有非1的…

ybw-github updated 2 years ago
1
scrapy/scrapy #6366

Latest allowed_domains behavior breaks middlewares that rewr…

### Description The change to behavior of `Spider.allowed_domains` in 2.11.2 broke several of our crawls because it does not play well with downloader middlewares that replace the original request …

rvandam updated 5 months ago
4
john-hu/untitled #97

Unknown lock issue

``` SQLite version 3.27.2 2019-02-25 16:06:06 Enter ".help" for usage hints. sqlite> select parse_state, count(*) from RECIPES_LIST group by parse_state; 0|11448 1|4712 2|1885 3|655 sqlite> `…

john-hu updated 2 years ago
4
scrapy/scrapy #6206

max_active_size gives no warning when queue processing block…

### Description When you hit the SCRAPER_SLOT_MAX_ACTIVE_SIZE requests stop being processed silently with no warning. If you are deferring items in a pipeline that depend on other requests fin…

djay updated 3 months ago
19

上一页 1...59 60 61 62 63 64 65...100 下一页

1000+ results for scrapy-spider

1000+ results
for scrapy-spider