scrapy-spider Search Results

1000+ results
for scrapy-spider

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

scrapy/scrapy #2645

Order of calling close_spider in pipelines

`process_item` is called based on the order of the pipeline classes mentioned in [`ITEM_PIPELINES`](https://doc.scrapy.org/en/latest/topics/settings.html#std:setting-ITEM_PIPELINES) setting. But, `clo…

debosmit updated 4 years ago
7
scrapinghub/frontera #63

exception during scrapy callback marked as queued

Hi, If there is any exception with response parsing in scrapy, the request remain marked as `QUEUED` and no error is logged on the frontier. …

RajatGoyal updated 9 years ago
8
scrapinghub/scrapyrt #157

ScrapyRT Port Unreachable in Kubernetes Docker Container Pod

I'm experiencing difficulties in accessing a ScrapyRT service running on specific ports within a Kubernetes pod. My setup includes a Kubernetes cluster with a pod running a Scrapy application, which u…

doverradio updated 7 months ago
1
scrapy/scrapy #2973

bindaddress doesn't bind IP when getting robots.txt

I have several failover IPs that are well configured (they work with wget or curl), and I would like to bind them when I use Scrapy, so I use the bindaddress key to achieve this, but the public IP is …

D-Kalck updated 6 years ago
2
scrapy/scrapy #2960

Class Request without the method __eq__ can't be compared di…

I met a interesting failure when I did a unittest about the method `process_spider_exception` of the `spider middleware`: In my project, this method returns a iterable (list) of request objects, wh…

grammy-jiang updated 5 years ago
1
huangmeng123/lit_char_data_wayback #2

File not found error (scrapy)

This is a great dataset by the way, and we wanted to use it for a group project we were doing. Unfortunately, we are running into some errors while following the steps. When I started the ./run.sh, th…

ajaybati updated 1 year ago
1
gnemoug/distribute_crawler #27

下载图片出错

[root@localhost woaidu_crawler]# scrapy crawl woaidu Unhandled error in Deferred: Unhandled Error Traceback (most recent call last): File "/usr/lib64/python2.7/site-packages/scrapy/commands/crawl.py…

cosisoft updated 7 years ago
1
scrapinghub/hcf-backend #15

Potential problem with reading batches when batches are dele…

Good day. Let's say we have a million requests inside a slot, then consumer defines either `HCF_CONSUMER_MAX_REQUESTS = 15000` or `HCF_CONSUMER_MAX_BATCHES = 150` or it just closes itself at after N h…

hermit-crab updated 5 years ago
4
scrapy/scrapy #4463

start_requests bypassing rules while working with CrawlSpide…

### Description I have been trying to use Scrapy's CrawlSpider to crawl listings from a website. The problem is the data comes from `XMLHttpRequest`. So, I have been using `[Puppeteer As A Servivce…

rhlr updated 1 year ago
6
scrapinghub/frontera #318

Keyword BACKEND Meaning Inconsistent Between Spider and Work…

Hi, there, I am working on Frontera these days, and Frontera is a great tool for cluster crawling! But I still find there is something not that easy to understand/figure out, because of the lack…

grammy-jiang updated 6 years ago
2

上一页 1...61 62 63 64 65 66 67...100 下一页

1000+ results for scrapy-spider

1000+ results
for scrapy-spider