-
I think we need a way to propagate the exit code from the Spider.
Based on: #1231
-
I tried to crawl a post with 2700 comments. But I can only run it to page 60
The post link:
m.facebook.com/story.php?story_fbid=2226458920929531&id=2226454927596597&**`p=60`**&av=100036884506828&e…
-
Hi, thank you for making this scraper! When running it on the bible translation with code "1805" however, I encounter the following issue:
```
Traceback (most recent call last):
File "/opt/home…
-
## Summary
It should be possible to update some of the jobs settings while they are running. This would be specially useful for the settings related to crawling speed.
## Motivation
I have e…
-
2023-03-12 18:15:15 [twisted] CRITICAL:
Traceback (most recent call last):
File "F:\python\anaconda\lib\site-packages\twisted\internet\defer.py", line 1697, in _inlineCallbacks
result = conte…
-
This issue has several components, all related to TimeoutErrors.
Scrapy 1.2.1
a) If a TimeoutError is raised, by default it will print the entire exception to the logger. This isn't in keeping wit…
-
I just downloaded logs for one splash application and it is really huge file. This is because we do splash:set_content(response.body) in one spider and this means that whole huge response body is dump…
-
`Caplog` start capturing, but in a specific place capturing is stopped.
It reproduces in one test and does not reproduce in the rest.
Additional I need `caplog.set_level('DEBUG')` for this test (spe…
-
https://biaggis.com/locations/
-
I am using anticaptcha api for solve captcha.I am getting success response from api see below
`Solved CAPTCHA: 03AFcWeA6MMcUMSkRKvTNeuenNTD8riCIwZ9UoQfciYaLV5BFxQIMnH8cFS6xElR-DGcYTedEVBjQbG9oRAHOm…