-
使用Github抓取博客链接、使用mongodb存储数据,在抓取阶段出现问题
`https://blog.akimio.top/links/`是用的是`butterfly`魔改主题(solitude)[https://github.com/everfu/hexo-theme-solitude],之前是可以正常抓取的,**一开始我怀疑是主题的问题,找了一个原版butterfly主题的友链,还是出现…
-
[Osmose](https://wiki.openstreetmap.org/wiki/Osmose) has some spiders written in Python, mostly for data feeds in France. However, other than AllThePlaces, Osmose doesn’t just fetch the data and conve…
-
在我运行python data_preprocess.py命令时,得到了如下报错:
Traceback (most recent call last):
File "data_preprocess.py", line 131, in
schema_linking_producer(spider_dev, spider_train, spider_table, spider_…
-
```
2024-06-22 22:27:27 [scrapy.core.scraper] ERROR: Spider error processing (referer: None)
Traceback (most recent call last):
File "/home/ubuntu/.pyenv/versions/3.11.9/lib/python3.11/site-pack…
-
-
I've been debugging this problem for a while, it's intermittent making it harder to reproduce.
When running some jobs with `scrapy-playwright` the jobs get's abruptly terminated, if you observe th…
-
The C-based version of SPIDER is being deprecated. There is a new version written in Python being developed (https://github.com/ExPlanetology/aragog), which uses SciPy rather than PETSc and thereby si…
-
为了更好的解决问题,请认真回答下面的问题。等到问题解决,请及时关闭本issue。
- 问:请您指明哪个版本运行出错(github版/PyPi版/全部)?
答:全部
- 问:您使用的是否是最新的程序(是/否)?
答:是
(还没安装,所以中间问题我删了)
- 问:如果方便,请您描述出错详情,最好附上错误提示。
答:
**我的python是3.12.2版本…
-
When I was working with the SDK, I found that the SDK was not very convenient for schedules and deployment of multiple spiders, so I wondered if it could be designed to look like the following
…
-
`
Exception ignored in:
Traceback (most recent call last):
File "d:\work\python\spider\login.py", line 20, in __del__
File "C:\Users\anerg\anaconda3\envs\DrissionPage\Lib\site-packages\Drissi…