sqzw-x / mdcx

Movie metadata scraper
GNU General Public License v3.0
1.45k stars 219 forks source link

madouqu 刮削失败 #21

Closed wuaishanshan closed 8 months ago

wuaishanshan commented 8 months ago
          6月份之后的版本原作者更换了部分网站的请求库为 `curl_cffi`,可能是这一改动导致的。

以下网站使用该库:airav.py airav_cc.py avsex.py freejavbt.py iqqtv.py javdb.py javlibrary.py madouqu.py mmtv.py 以及获取亚马逊图片,可以看看其它网站是否同样报错 不排除是第三方库的问题,但是无法稳定复现

Originally posted by @sqzw-x in https://github.com/sqzw-x/mdcx/issues/14#issuecomment-1872503116 不是很懂 ⏰ 09:10:13 🍯 你可以点击左下角的图标来 显示 / 隐藏 请求信息面板! ⏰ 09:10:13 🔎 请求 https://api.github.com/repos/anyabc/something/releases/latest ⏰ 09:10:14 [1/3] https://api.github.com/repos/anyabc/something/releases/latest Error: HTTPSConnectionPool(host='api.github.com', port=443): Max retries exceeded with url: /repos/anyabc/something/releases/latest (Caused by ProxyError('Cannot connect to proxy.', FileNotFoundError(2, 'No such file or directory'))) ⏰ 09:10:14 [2/3] https://api.github.com/repos/anyabc/something/releases/latest Error: HTTPSConnectionPool(host='api.github.com', port=443): Max retries exceeded with url: /repos/anyabc/something/releases/latest (Caused by ProxyError('Cannot connect to proxy.', FileNotFoundError(2, 'No such file or directory'))) ⏰ 09:10:15 [3/3] https://api.github.com/repos/anyabc/something/releases/latest Error: HTTPSConnectionPool(host='api.github.com', port=443): Max retries exceeded with url: /repos/anyabc/something/releases/latest (Caused by ProxyError('Cannot connect to proxy.', FileNotFoundError(2, 'No such file or directory'))) ⏰ 09:10:15 🔴 请求失败!https://api.github.com/repos/anyabc/something/releases/latest Error: HTTPSConnectionPool(host='api.github.com', port=443): Max retries exceeded with url: /repos/anyabc/something/releases/latest (Caused by ProxyError('Cannot connect to proxy.', FileNotFoundError(2, 'No such file or directory'))) ⏰ 09:16:28 🔎 遍历待刮削目录.... ⏰ 09:16:28 ✅ Found (170)! Skip successfully scraped (0) repeat softlink (0)! (0s)... Still searching, please wait...   ⏰ 09:16:28 🎉 Done!!! Found (193)! Skip successfully scraped (0) repeat softlink (0)! (0s)   ⏰ 09:16:28 🔎 Scraper请求 https://madouqu.com/?s=XXXX-2415 ⏰ 09:16:30 ✅ Scraper成功 https://madouqu.com/?s=XXXX-2415 ⏰ 09:16:37 🔎 Scraper请求 https://madouqu.com/?s=XXXX-2415 ⏰ 09:16:38 ✅ Scraper成功 https://madouqu.com/?s=XXXX-2415 ⏰ 09:16:46 🔎 Scraper请求 https://madouqu.com/?s=XXXX-2415 ⏰ 09:16:47 ✅ Scraper成功 https://madouqu.com/?s=XXXX-2415 ⏰ 09:16:49 ⛔️ 正在停止正在运行的任务线程 (21) ... ⏰ 09:16:49 正在停止线程: 1/21 MDCx-Pool_0 ... ⏰ 09:16:49 正在停止线程: 2/21 MDCx-Pool_1 ... ⏰ 09:16:49 正在停止线程: 3/21 MDCx-Pool_2 ... ⏰ 09:16:49 正在停止线程: 4/21 MDCx-Pool_3 ... ⏰ 09:16:49 正在停止线程: 5/21 MDCx-Pool_4 ... ⏰ 09:16:49 正在停止线程: 6/21 MDCx-Pool_5 ... ⏰ 09:16:49 正在停止线程: 7/21 MDCx-Pool_6 ... ⏰ 09:16:49 正在停止线程: 8/21 MDCx-Pool_7 ... ⏰ 09:16:49 正在停止线程: 9/21 MDCx-Pool_8 ... ⏰ 09:16:49 正在停止线程: 10/21 MDCx-Pool_9 ... ⏰ 09:16:49 正在停止线程: 11/21 MDCx-Pool_10 ... ⏰ 09:16:49 正在停止线程: 12/21 MDCx-Pool_11 ... ⏰ 09:16:49 正在停止线程: 13/21 MDCx-Pool_12 ... ⏰ 09:16:49 正在停止线程: 14/21 MDCx-Pool_13 ... ⏰ 09:16:49 正在停止线程: 15/21 MDCx-Pool_14 ... ⏰ 09:16:49 正在停止线程: 16/21 MDCx-Pool_15 ... ⏰ 09:16:49 正在停止线程: 17/21 MDCx-Pool_16 ... ⏰ 09:16:49 正在停止线程: 18/21 MDCx-Pool_17 ... ⏰ 09:16:49 正在停止线程: 19/21 MDCx-Pool_18 ... ⏰ 09:16:49 正在停止线程: 20/21 MDCx-Pool_19 ... ⏰ 09:16:49 正在停止线程: 21/21 MDCx-Scrape-Thread ... ⏰ 09:16:49 线程正在停止中,请稍后... 🍯 停止时间与线程数量及线程正在执行的任务有关,比如正在执行网络请求、文件下载等IO操作时,需要等待其释放资源。。。

⏰ 09:16:50 所有线程已停止!!!(1s) ⛔️ 刮削已手动停止! 这是6月的版本连接正常,下面是12月的新版 ⏰ 09:17:02 🍯 你可以点击左下角的图标来 显示 / 隐藏 请求信息面板! ⏰ 09:17:02 🔎 请求 https://api.github.com/repos/sqzw-x/mdcx/releases/latest ⏰ 09:17:05 ✅ 成功 https://api.github.com/repos/sqzw-x/mdcx/releases/latest ⏰ 09:17:05 🔎 请求 https://javbus.com/FSDSS-660 ⏰ 09:17:09 ✅ 成功 https://javbus.com/FSDSS-660 ⏰ 09:17:13 🔎 遍历待刮削目录.... ⏰ 09:17:13 ✅ Found (170)! Skip successfully scraped (0) repeat softlink (0)! (0s)... Still searching, please wait...   ⏰ 09:17:13 🎉 Done!!! Found (193)! Skip successfully scraped (0) repeat softlink (0)! (1s)   ⏰ 09:17:13 🔎 请求 https://madouqu.com/?s=XXXX-2415 ⏰ 09:17:23 🔎 请求 https://madouqu.com/?s=XXXX-2415 ⏰ 09:17:33 🔎 请求 https://madouqu.com/?s=XXXX-2415 ⏰ 09:17:43 🔎 请求 https://madouqu.com/?s=XXXX-2415 ⏰ 09:17:43 [1/3] https://madouqu.com/?s=XXXX-2415 Error: Failed to perform, ErrCode: 28, Reason: 'Operation timed out after 30001 milliseconds with 0 bytes received'. This may be a libcurl error, See https://curl.se/libcurl/c/libcurl-errors.html first for more details. ⏰ 09:17:53 [1/3] https://madouqu.com/?s=XXXX-2415 Error: Failed to perform, ErrCode: 28, Reason: 'Operation timed out after 30003 milliseconds with 0 bytes received'. This may be a libcurl error, See https://curl.se/libcurl/c/libcurl-errors.html first for more details. ⏰ 09:17:53 🔎 请求 https://madouqu.com/?s=DX-020 ⏰ 09:18:03 [1/3] https://madouqu.com/?s=XXXX-2415 Error: Failed to perform, ErrCode: 28, Reason: 'Operation timed out after 30001 milliseconds with 0 bytes received'. This may be a libcurl error, See https://curl.se/libcurl/c/libcurl-errors.html first for more details. ⏰ 09:18:03 🔎 请求 https://madouqu.com/?s=EMX-045 ⏰ 09:18:04 ⛔️ 正在停止正在运行的任务线程 (11) ... ⏰ 09:18:04 正在停止线程: 1/11 MDCx-Pool_0 ... ⏰ 09:18:04 正在停止线程: 2/11 MDCx-Pool_1 ... ⏰ 09:18:04 正在停止线程: 3/11 MDCx-Pool_2 ... ⏰ 09:18:04 正在停止线程: 4/11 MDCx-Pool_3 ... ⏰ 09:18:04 正在停止线程: 5/11 MDCx-Pool_4 ... ⏰ 09:18:04 正在停止线程: 6/11 MDCx-Pool_5 ... ⏰ 09:18:04 正在停止线程: 7/11 MDCx-Pool_6 ... ⏰ 09:18:04 正在停止线程: 8/11 MDCx-Pool_7 ... ⏰ 09:18:04 正在停止线程: 9/11 MDCx-Pool_8 ... ⏰ 09:18:04 正在停止线程: 10/11 MDCx-Pool_9 ... ⏰ 09:18:04 正在停止线程: 11/11 MDCx-Scrape-Thread ... ⏰ 09:18:04 线程正在停止中,请稍后... 🍯 停止时间与线程数量及线程正在执行的任务有关,比如正在执行网络请求、文件下载等IO操作时,需要等待其释放资源。。。

⏰ 09:18:33 所有线程已停止!!!(30s) ⛔️ 刮削已手动停止!

wuaishanshan commented 8 months ago

IMG20231231110235 中文名称刮削番号地址正确,数据显示也获取成功但就是不下载图片,这是什么问题呢

sqzw-x commented 8 months ago

以下网站使用该库:airav.py airav_cc.py avsex.py freejavbt.py iqqtv.py javdb.py javlibrary.py madouqu.py mmtv.py 以及获取亚马逊图片

检查你能否正常刮削以上网站,如果只有 madouqu 失败,说明是 mdcx 的问题;如果全部失败,说明是你的网络问题或者第三方库问题

中文名称刮削番号地址正确,数据显示也获取成功但就是不下载图片,这是什么问题呢

已修复,将随下个版本发布

sqzw-x commented 8 months ago

另外,你图片里这次刮削没有报网络错误,这说明 mdcx 成功请求了网站,因此你此前的网络请求失败大概率是当时的网络有问题

wuaishanshan commented 8 months ago

我也不清楚啥问题,我把clash的v6打开了就可以了,但是旧版本就不用打开也行,挺奇怪的