-
getcookie 函数无法正确获取,获取的url不支持https://github.com/owner888/phpspider/issues/new多路径网址。
$spider->on_start = function($phpspider)
{
$cookie = requests::get_cookie("SUB", "s.weibo.com");
// 把C…
-
2022-06-01 21:15:31 [scrapy.core.scraper] ERROR: Spider error processing (referer: None)
Traceback (most recent call last):
File "C:\Users\xxx\anaconda3\envs\py372\lib\site-packages\scrapy\utils\…
-
Traceback (most recent call last):
File "D:\anoconda\envs\python36\lib\runpy.py", line 193, in _run_module_as_main
"__main__", mod_spec) …
-
2022-02-16 14:06:22 [scrapy.core.scraper] ERROR: Spider error processing (referer: None)
Traceback (most recent call last):
File "/home/vsi/.local/lib/python3.7/site-packages/scrapy/utils/defer.p…
-
D:\weibo\weibo-search-master>scrapy crawl search
Traceback (most recent call last):
File "D:\Programs\Python\Python38\lib\runpy.py", line 193, in _run_module_as_main
return _run_code(code, ma…
-
D:\weibo\weibo-search-master>scrapy crawl search
Traceback (most recent call last):
File "D:\Programs\Python\Python38\lib\runpy.py", line 193, in _run_module_as_main
return _run_code(code, ma…
-
(rectflow) amax@amax:/data/lu/yyl/weibo-search/weibo-search-master$ scrapy crawl search -s JOBDIR=crawls/search -L INFO
[1]+ 已完成 scrapy crawl search -s JOBDIR=crawls/search
2023-04-08…
-
大大你好,我每次爬取一天的数据,第一次运行了大约7个小时,是ok的,但之后每次大约四个小时之后就会报错:
2022-01-15 14:09:36 [scrapy.downloadermiddlewares.retry] ERROR: Gave up retrying (failed 3 times): []
2022-01-15 14:09:36 [scrapy.core.scraper] …
-
Traceback (most recent call last):
File "C:/Users/June/Desktop/sina-weibo-crawler-master/spider.py", line 12, in
print(crawler.crawl(url = 'http://weibo.cn/yaochen'))
File "C:\Users\June\D…
-
运行报错:
UnicodeEncodeError: 'gbk' codec can't encode character '\U0001f525' in position 400: illegal multibyte sequence
处理字符时遇到了 Unicode 编码问题,'gbk' 编码不支持。字符 '\U0001f525' 是🔥表情符号。