scrapy-spider Search Results

scrapy/scrapy #6437

process_spider_exception not executed for exceptions in errb…

Extension of https://github.com/scrapy/scrapy/issues/1015 - spider exceptions don't trigger `process_spider_exception` if they're called from an `errback` method. ``` import logging from scra…

mohmad-null updated 1 week ago

apify/crawlee-python #295

HTTP API for Spider

`Scrapy` offers an HTTP API through a third-party library called `ScrapyRT`, which exposes an HTTP API for spiders. By sending a request to `ScrapyRT` with the spider name and URL, you receive the ite…

Ehsan-U updated 1 week ago

alltheplaces/alltheplaces #8693

AttributeError: 'NoneType' object has no attribute 'replace'…

``` 2024-06-22 22:27:27 [scrapy.core.scraper] ERROR: Spider error processing (referer: None) Traceback (most recent call last): File "/home/ubuntu/.pyenv/versions/3.11.9/lib/python3.11/site-pack…

matkoniecz updated 3 days ago

diegov/searchbox #36

UberSpider and decoupling spiders from Scrapy

The only parts of Scrapy that we take advantage of are the scheduler and the downloader. Its management of crawlers and spiders doesn't add anything to our usecase, and the abstractions provided add c…

diegov updated 1 month ago

scrapy/scrapy #6425

Exception in FeedExporter when Using Path Objects with Stora…

### Description According to the [documentation](https://docs.scrapy.org/en/latest/topics/feed-exports.html#feeds), the `FEEDS` dict accepts `Path` objects as keys: > [...] dictionary in whi…

kkmarv updated 2 weeks ago

manuelandersen/padel-scrapy #8

workflows runs on

Right now the Scrapy Spider Workflow #7 gets trigger on every push (pr and merge) it should only work with pr's.

manuelandersen updated 3 weeks ago

rmax/scrapy-redis #285

[Question] Fetch request url from redis fail

# Description If i insert start url to redis before run scrapy, is successful. But if i run scrapy first and insert url, listen url will get fail info: ``` 2023-08-13 17:11:59 [scrapy.utils.…

KokoTa updated 3 weeks ago

scrapy/scrapy #6410

Cookies with domain localhost & IPV4 addresses won't get set

### Description When setting cookies on a request, you can specify a domain. If you set the domain to "localhost" or any IPV4 address, it won't get set on requests for "localhost"/the IPV4 address.…

pvanderlinden updated 2 weeks ago

scrapy/scrapy #6433

core.engine/Signal handler polluting log

### Description The `OffsiteMiddleware` logs a single message for each domain filtered. Great! But then the `core.engine` logs a message for every single url filtered by the OffsiteMiddleware. (L…

djuntsu updated 2 weeks ago

dataabc/weibo-search #479

请问一运行就会有这个错误

C:\Users\33721\Desktop\weibo-search-master>scrapy crawl search -s JOBDIR=crawls/search 2024-05-18 12:51:09 [scrapy.core.scraper] ERROR: Spider error processing (referer: https://s.weibo.com/weibo?…

ArcherChloe updated 1 month ago

1000+ results for scrapy-spider

1000+ results
for scrapy-spider