NanmiCoder / MediaCrawler

小红书笔记 | 评论爬虫、抖音视频 | 评论爬虫、快手视频 | 评论爬虫、B 站视频 | 评论爬虫、微博帖子 | 评论爬虫、百度贴吧帖子 | 百度贴吧评论回复爬虫 | 知乎问答文章|评论爬虫
Other
16.47k stars 5.26k forks source link

获取的content乱码 #364

Closed 807660937 closed 1 month ago

807660937 commented 1 month ago

为何xhs爬取到的title、desc、content都是类似的 __,有什么办法能获取到正常内容不

[{"note_id": "65c73740000000002c036d87", "type": "video", "title": "____________________", "desc": "__________________________________________#________[____]#", "video_url": "http://sns-video-bd.xhscdn.com/spectrum/1040g0jg30v07kte3l8005of1f6g8dbdq8iiutqg", "time": 1707554624000, "last_update_time": 1707558305000, "user_id": "61e179a0000000002102adba", "nickname": "________", "avatar": "https://sns-avatar-qc.xhscdn.com/avatar/6297376c04470a00017e1f2c.jpg", "liked_count": "732969", "collected_count": "199672", "comment_count": "13019", "share_count": "10585", "ip_location": "", "image_list": "http://sns-webpic-qc.xhscdn.com/202407301226/704eb449be39ee15d18b9d442d9b226f/spectrum/1040g0k030v07nep85g005of1f6g8dbdq8jkhpc8!nd_dft_wlteh_webp_3", "tag_list": "________", "last_modify_ts": 1722313577987, "note_url": "https://www.xiaohongshu.com/explore/65c73740000000002c036d87"}, {"note_id": "65cc8f5c00000000070267e9", "type": "video", "title": "________________________~", "desc": "#________[____]# ________________
~", "video_url": "http://sns-video-bd.xhscdn.com/pre_post/1040g2t030v5em9e85g004a4at4856bhjkud1s68", "time": 1707904860000, "last_update_time": 1707904861000, "user_id": "5ac7505311be102ccdbd2e33", "nickname": "____", "avatar": "https://sns-avatar-qc.xhscdn.com/avatar/5c7d1a340355710001eee534.jpg", "liked_count": "547043", "collected_count": "88335", "comment_count": "9132", "share_count": "3259", "ip_location": "", "image_list": "http://sns-webpic-qc.xhscdn.com/202407301226/4d0000152b626841e757dfc7d07fd450/1040g2sg30v5enr0c56004a4at4856bhj95hjicg!nd_dft_wlteh_webp_3", "tag_list": "________", "last_modify_ts": 1722313577989, "note_url": "https://www.xiaohongshu.com/explore/65cc8f5c00000000070267e9"}, {"note_id": "65e2b145000000000102ac93", "type": "video", "title": "____________________________", "desc": "#________[____]#__#____[____]#", "video_url": "http://sns-video-bd.xhscdn.com/spectrum/1040g0jg30vr2avfplq005of1f6g8dbdqhg6eqi0", "time": 1709355333000, "last_update_time": 1709355334000, "user_id": "61e179a0000000002102adba", "nickname": "________", "avatar": "https://sns-avatar-qc.xhscdn.com/avatar/6297376c04470a00017e1f2c.jpg", "liked_count": "340804", "collected_count": "41903", "comment_count": "2001", "share_count": "1325", "ip_location": "", "image_list": "http://sns-webpic-qc.xhscdn.com/202407301226/64b5a156cd41b2582035bc2509518dd1/spectrum/1040g0k030vr2c9dt5u005of1f6g8dbdqoqsqif0!nd_dft_wlteh_webp_3", "tag_list": "________,____", "last_modify_ts": 1722313577990, "note_url": "https://www.xiaohongshu.com/explore/65e2b145000000000102ac93"}]
"title": "________________________________________", "desc": "[____R]_#______[____]#__#____________[____]#__#____________[____]#__#__________[____]#__#2024__________[__
__]#", "video_url": "http://sns-video-bd.xhscdn.com/spectrum/1040g0jg30ulkicjg5a005nqcl63g8kt0s3jnuk8", "time": 1706843463000, "last_update_time": 1706843463000, "user_id": "5f4ca98700000000010053a0", "nickname": "__________", "avatar": "https://sns-avatar-qc.xhscdn.com/avatar/1040g2jo3158vbofdgm605nqcl63g8kt0l2s3710", "liked_count": "48185", "collected_count": "3465", "comment_count": "294", "share_count": "359", "ip_location": "", "image_list": "http://sns-webpic-qc.xhscdn.com/202407301033/e48b0c3a95756a963672b14acfa079df/spectrum/1040g0k030ulkk14s5c005nqcl63g8kt0e0r1qgg!nd_dft_wlteh_webp_3", "tag_list": "______,____________,____________,__________,2024__________", "last_modify_ts": 1722306826055, "note_url": "https://www.xiaohongshu.com/explore/65bc5d47000000002c014720"}, {"note_id": "65cb8403000000002d000aff", "type": "video", "title": "________________________________", "desc": "______________________________________\n\t\n____________________________________\n\t\n______________________\n\t\n________
____________\n\t\n__________________________\n\t\n#________[____]# #____[____]# #________[____]# #________100__[____]# #______[____]#", "video_url": "http://sns-video-bd.xhscdn.com/pre_post/1040g2t030v4djk45l6605no6stl0btgv9q5sn68", "time": 1707836419000, "last_update_time": 1707836420000, "user_id": "5f06e76a000000000101f61f", "nickname": "__________", "avatar": "https://sns-avatar-qc.xhscdn.com/avatar/1040g2jo30v96a9h75c505no6stl0btgvh6o38a0", "liked_count": "45157", "collected_count": "10006", "comment_count": "1183", "share_count": "879", "ip_location": "", "image_list": "http://sns-webpic-qc.xhscdn.com/202407301033/1c29225f829bb6c3a21b0f66b12af68a/1040g2sg30v4dpbnm58605no6stl0btgva4qd85g!nd_dft_wlteh_webp_3", "tag_list": "________,____,________,________100__,______", "last_modify_ts": 1722306826057, "note_url": "https://www.xiaohongshu.com/explore/65cb8403000000002d000aff"}, {"note_id": "65cadc13000000002c02b366", "type": "video", "title": "_______________________________________________#____________
[____]#_#____[____]#__#______[____]#", "desc": "_______________________________________________#____________[____]#_#____[____]#__#______[____]#", "video_url": "http://sns-video-bd.xhscdn.com/spectrum/1040g0jg30v3pibes5g005o5f0s0g90p1roi3sso", "time": 1707793427000, "last_update_time": 1707793428000, "user_id": "60af07010000000001008321", "nickname": "________", "avatar": "https://sns-avatar-qc.xhscdn.com/avatar/6465ca9a2a8197c5fd9cbfe5.jpg", "liked_count": "45005", "collected_count": "2684", "comment_count": "548", "share_count": "128", "ip_location": "", "image_list": "http://sns-webpic-qc.xhscdn.com/202407301033/0621de9275043ff2c49be29a20092d95/spectrum/1040g0k030v3pjfn4li005o5f0s0g90p174rje8g!nd_dft_wlteh_webp_3", "tag_list": "____________,____,______", "last_modify_ts": 1722306826059, "note_url": "https://www.xiaohongshu.com/explore/65cadc13000000002c02b366"}
{"comment_id": "65c98ebb000000001a01e8b2", "create_time": 1707708091000, "ip_location": null, "note_id": "65c73740000000002c036d87", "content": "@momo [____R][____R][____R]", "user_id": "5afad7eb4eacab5d8ab94f4b", "nickname": "5our", "avatar": "https://sns-avatar-qc.xhscdn.com/avatar/1040g2jo31558p1u71a004a5blfbumjqboachne8?imageView2/2/w/120/format/jpg", "sub_comment_count": "0", "pictures": "", "parent_comment_id": 0, "last_modify_ts": 1722312186467}, {"comment_id": "65d1dabd0000000009014164", "create_time": 1708251837000, "ip_location": null, "note_id": "65d18b750000000007004962", "content": "________________________[____R]", "user_id": "5897fb035e87e70c947132d2", "nickname": "Aquarius", "avatar": "https://sns-avatar-qc.xhscdn.com/avatar/5897fb035e87e70c947132d2.jpg?imageView2/2/w/120/format/jpg", "sub_comment_count": 0, "pictures": "", "parent_comment_id": "65d1d169000000000501ab7d", "last_modify_ts": 1722312187310}
NanmiCoder commented 1 month ago

尝试更换xhs账号,看看该问题是否还存在