-
Hello,
Crawlers won't shutdown gracefully since CrawlerProcess' `_start_crawler` method pops crawlers off the `self.crawlers` list, which is where the `stop` method looks for crawlers to stop.
Steps…
demji updated
2 months ago
-
_This is a bit of a [WIP] issue because there are more details to be added when I have the time_
environ.py has the code responsible for deleting logfiles.
``` python
def _get_file(self, message…
-
When the chrome is killed or crash, the context will continue create newpage and throw exception:
```log
2023-01-31 19:29:51 [scrapy.core.scraper] ERROR: Error downloading
Traceback (most recent c…
-
### Brand name
UBB / ОББ
### Wikidata ID
Q7887555
### Store finder url(s)
https://www.ubb.bg/offices/pins?city_id=&srch_offices=&_=1694684624025
`pin_type` is used to diffrentiate between `off…
-
I seem to be getting the following issue but I am unsure why the argument passed is invalid?
Model Name: MacBook Pro
Model Identifier: Mac14,7
Model Number: MNEJ3LL/A
Chip: Apple M2
Total Numbe…
-
### Environment
- python 3.10
- OS: macOS 14.1.1, Ubuntu 22.04 LTS
- playwright Version 1.42.0
When PLAYWRIGHT_BROWSER_TYPE set as 'chromium' (or default) under macOS, , there appears to be a me…
-
### Brand name
Mikucha
Thai specialty teas and drinks
### Wikidata ID
Q118640408
https://www.wikidata.org/wiki/Q118640408
https://www.wikidata.org/wiki/Special:EntityData/Q118640408.json…
-
Bulgarian Posts have released a new API which shows the same data in machine readable format. Even the opening hours are in a sane format.
https://api.bgpost.bg/nomenclature/Office/details/all
-
### Brand name
Praktiker
### Wikidata ID
Q110399491
### Store finder url(s)
https://api.praktiker.bg/videoluxcommercewebservices/v2/praktiker/mapbox/customerpreferedstore
-
When running spiders that do nothing at all, the sqlite based poller uses all cpu just reading scheduled tasks. It would be good to have a plug and play alternative queues like redis.