NanmiCoder / MediaCrawler

小红书笔记 | 评论爬虫、抖音视频 | 评论爬虫、快手视频 | 评论爬虫、B 站视频 | 评论爬虫、微博帖子 | 评论爬虫、百度贴吧帖子 | 百度贴吧评论回复爬虫 | 知乎问答文章|评论爬虫
Other
16.47k stars 5.26k forks source link

b站评论爬取,数据保存为json只有一条,保存为csv能正常爬取 #390

Closed mouzzmou closed 3 weeks ago

mouzzmou commented 1 month ago

求大佬解答,我把账号,ip都换了一篇,发现数据保存类型改变为json,就爬取失败,改成csv又是正常爬取,我想要词云图。 (venv) D:\MediaCrawler-main>python main.py --platform bili --lt qrcode --type detail 2024-08-13 15:53:30 MediaCrawler INFO (core.py:303) - [BilibiliCrawler.launch_browser] Begin create browser context ... 2024-08-13 15:53:33 MediaCrawler INFO (core.py:265) - [BilibiliCrawler.create_bilibili_client] Begin create bilibili API client ... 2024-08-13 15:53:33 MediaCrawler INFO (client.py:98) - [BilibiliClient.pong] Begin pong bilibili... 2024-08-13 15:53:34 MediaCrawler INFO (client.py:104) - [BilibiliClient.pong] Use cache login state get web interface successfull! 2024-08-13 15:53:35 MediaCrawler INFO (init.py:52) - [store.bilibili.update_bilibili_video] bilibili video id:931463745, title:破亿纪念!【猛男版】新宝岛 4K高清重置加强版 2024-08-13 15:53:35 MediaCrawler INFO (core.py:336) - [BilibiliCrawler.get_bilibili_video] Crawling image mode is not enabled 2024-08-13 15:53:35 MediaCrawler INFO (core.py:146) - [BilibiliCrawler.batch_get_video_comments] video ids:[931463745] 2024-08-13 15:53:35 MediaCrawler INFO (core.py:165) - [BilibiliCrawler.get_comments] begin get video_id: 931463745 comments ... 2024-08-13 15:53:36 MediaCrawler INFO (init.py:99) - [store.bilibili.update_bilibili_video_comment] Bilibili video comment: 4851809144, content: 1.14版本主要修改: .增强了画质 .移除了领舞的项链 .增长了领舞的头发长度 .胖哥 头发颜色更变 .别的不知道了

(venv) D:\MediaCrawler-main>python main.py --platform bili --lt qrcode --type detail

NanmiCoder commented 4 weeks ago

错误日志是啥,我这边测试了没有发现有问题

mouzzmou commented 3 weeks ago

直接结束了的,没报错 443

NanmiCoder commented 3 weeks ago

未能复现

NanmiCoder commented 3 weeks ago

还是未能复现该问题