eight04 / ComicCrawler

An image crawler written in Python.
265 stars 47 forks source link

instagram 下載失敗 #283

Closed bluelovers closed 5 months ago

bluelovers commented 3 years ago
requests.exceptions.HTTPError: 429 Client Error: - for url: https://www.instagram.com/p/BKeRcvsA-5S/
Start download [instagram] ALEX JOHNSTON 🌶(@alexspicyboy)• Instagram 相片與影片
Start downloading [instagram] ALEX JOHNSTON 🌶(@alexspicyboy)• Instagram 相片與影片
total 189 episode.
Downloading ep BLkPFi9jl-A
Traceback (most recent call last):
  File "c:\python37-32\lib\site-packages\comiccrawler\crawler.py", line 336, in error_loop
    process()
  File "c:\python37-32\lib\site-packages\comiccrawler\crawler.py", line 287, in download
    crawler.init()
  File "c:\python37-32\lib\site-packages\comiccrawler\crawler.py", line 51, in init
    self.init_images(self.ep.current_page - 1)
  File "c:\python37-32\lib\site-packages\comiccrawler\crawler.py", line 57, in init_images
    self.get_html()
  File "c:\python37-32\lib\site-packages\comiccrawler\crawler.py", line 175, in get_html
    self.html = self.downloader.html(self.ep.current_url)
  File "c:\python37-32\lib\site-packages\comiccrawler\module_grabber.py", line 17, in html
    **kwargs
  File "c:\python37-32\lib\site-packages\comiccrawler\grabber.py", line 151, in grabhtml
    r = grabber(*args, **kwargs)
  File "c:\python37-32\lib\site-packages\comiccrawler\grabber.py", line 105, in grabber
    r = await_(do_request, s, url, proxies, retry, **kwargs)
  File "c:\python37-32\lib\site-packages\worker\__init__.py", line 905, in wrapped
    return f(*args, **kwargs)
  File "c:\python37-32\lib\site-packages\worker\__init__.py", line 927, in await_
    return async_(callback, *args, **kwargs).get()
  File "c:\python37-32\lib\site-packages\worker\__init__.py", line 682, in get
    raise err
  File "c:\python37-32\lib\site-packages\worker\__init__.py", line 474, in wrap_worker
    self.ret = self.worker(*args, **kwargs)
  File "c:\python37-32\lib\site-packages\comiccrawler\grabber.py", line 131, in do_request
    r.raise_for_status()
  File "c:\python37-32\lib\site-packages\requests\models.py", line 940, in raise_for_status
    raise HTTPError(http_error_msg, response=self)
requests.exceptions.HTTPError: 429 Client Error: - for url: https://www.instagram.com/p/BLkPFi9jl-A/
Thread crashed: <function DownloadManager.start_download.<locals>.do_download at 0x0195A198>
Traceback (most recent call last):
  File "c:\python37-32\lib\site-packages\worker\__init__.py", line 474, in wrap_worker
    self.ret = self.worker(*args, **kwargs)
  File "c:\python37-32\lib\site-packages\comiccrawler\download_manager.py", line 127, in do_download
    download(mission, profile(mission.module.config["savepath"]))
  File "c:\python37-32\lib\site-packages\comiccrawler\crawler.py", line 226, in download
    crawl(mission, savepath)
  File "c:\python37-32\lib\site-packages\comiccrawler\crawler.py", line 264, in crawl
    crawlpage(crawler)
  File "c:\python37-32\lib\site-packages\comiccrawler\crawler.py", line 329, in crawlpage
    error_loop(download, download_error)
  File "c:\python37-32\lib\site-packages\comiccrawler\crawler.py", line 343, in error_loop
    handle_error(er)
  File "c:\python37-32\lib\site-packages\comiccrawler\crawler.py", line 325, in download_error
    raise er
  File "c:\python37-32\lib\site-packages\comiccrawler\crawler.py", line 336, in error_loop
    process()
  File "c:\python37-32\lib\site-packages\comiccrawler\crawler.py", line 287, in download
    crawler.init()
  File "c:\python37-32\lib\site-packages\comiccrawler\crawler.py", line 51, in init
    self.init_images(self.ep.current_page - 1)
  File "c:\python37-32\lib\site-packages\comiccrawler\crawler.py", line 57, in init_images
    self.get_html()
  File "c:\python37-32\lib\site-packages\comiccrawler\crawler.py", line 175, in get_html
    self.html = self.downloader.html(self.ep.current_url)
  File "c:\python37-32\lib\site-packages\comiccrawler\module_grabber.py", line 17, in html
    **kwargs
  File "c:\python37-32\lib\site-packages\comiccrawler\grabber.py", line 151, in grabhtml
    r = grabber(*args, **kwargs)
  File "c:\python37-32\lib\site-packages\comiccrawler\grabber.py", line 105, in grabber
    r = await_(do_request, s, url, proxies, retry, **kwargs)
  File "c:\python37-32\lib\site-packages\worker\__init__.py", line 905, in wrapped
    return f(*args, **kwargs)
  File "c:\python37-32\lib\site-packages\worker\__init__.py", line 927, in await_
    return async_(callback, *args, **kwargs).get()
  File "c:\python37-32\lib\site-packages\worker\__init__.py", line 682, in get
    raise err
  File "c:\python37-32\lib\site-packages\worker\__init__.py", line 474, in wrap_worker
    self.ret = self.worker(*args, **kwargs)
  File "c:\python37-32\lib\site-packages\comiccrawler\grabber.py", line 131, in do_request
    r.raise_for_status()
  File "c:\python37-32\lib\site-packages\requests\models.py", line 940, in raise_for_status
    raise HTTPError(http_error_msg, response=self)
requests.exceptions.HTTPError: 429 Client Error: - for url: https://www.instagram.com/p/BLkPFi9jl-A/
Start download [instagram] 류원 Ryuwon(@rw2100)• Instagram 相片與影片
Start downloading [instagram] 류원 Ryuwon(@rw2100)• Instagram 相片與影片
total 72 episode.
Downloading ep BS79ztvAV-Z
Traceback (most recent call last):
  File "c:\python37-32\lib\site-packages\comiccrawler\crawler.py", line 336, in error_loop
    process()
  File "c:\python37-32\lib\site-packages\comiccrawler\crawler.py", line 287, in download
    crawler.init()
  File "c:\python37-32\lib\site-packages\comiccrawler\crawler.py", line 51, in init
    self.init_images(self.ep.current_page - 1)
  File "c:\python37-32\lib\site-packages\comiccrawler\crawler.py", line 57, in init_images
    self.get_html()
  File "c:\python37-32\lib\site-packages\comiccrawler\crawler.py", line 175, in get_html
    self.html = self.downloader.html(self.ep.current_url)
  File "c:\python37-32\lib\site-packages\comiccrawler\module_grabber.py", line 17, in html
    **kwargs
  File "c:\python37-32\lib\site-packages\comiccrawler\grabber.py", line 151, in grabhtml
    r = grabber(*args, **kwargs)
  File "c:\python37-32\lib\site-packages\comiccrawler\grabber.py", line 105, in grabber
    r = await_(do_request, s, url, proxies, retry, **kwargs)
  File "c:\python37-32\lib\site-packages\worker\__init__.py", line 905, in wrapped
    return f(*args, **kwargs)
  File "c:\python37-32\lib\site-packages\worker\__init__.py", line 927, in await_
    return async_(callback, *args, **kwargs).get()
  File "c:\python37-32\lib\site-packages\worker\__init__.py", line 682, in get
    raise err
  File "c:\python37-32\lib\site-packages\worker\__init__.py", line 474, in wrap_worker
    self.ret = self.worker(*args, **kwargs)
  File "c:\python37-32\lib\site-packages\comiccrawler\grabber.py", line 131, in do_request
    r.raise_for_status()
  File "c:\python37-32\lib\site-packages\requests\models.py", line 940, in raise_for_status
    raise HTTPError(http_error_msg, response=self)
requests.exceptions.HTTPError: 429 Client Error: - for url: https://www.instagram.com/p/BS79ztvAV-Z/
Stop downloading
停止下載
bluelovers commented 3 years ago

KeyError: 'PostPage'
Traceback (most recent call last):
  File "c:\python37-32\lib\site-packages\comiccrawler\crawler.py", line 336, in error_loop
    process()
  File "c:\python37-32\lib\site-packages\comiccrawler\crawler.py", line 287, in download
    crawler.init()
  File "c:\python37-32\lib\site-packages\comiccrawler\crawler.py", line 51, in init
    self.init_images(self.ep.current_page - 1)
  File "c:\python37-32\lib\site-packages\comiccrawler\crawler.py", line 58, in init_images
    self.get_images()
  File "c:\python37-32\lib\site-packages\comiccrawler\crawler.py", line 186, in get_images
    self.ep.current_url
  File "c:\python37-32\lib\site-packages\comiccrawler\mods\instagram.py", line 72, in get_images
    media = get_init_data(html, "PostPage")["shortcode_media"]
  File "c:\python37-32\lib\site-packages\comiccrawler\mods\instagram.py", line 69, in get_init_data
    return shared_data["entry_data"][page][0]["graphql"]
KeyError: 'PostPage'
Something bad happened, skip the episode.
Downloading ep B_nil7dJOdl
Traceback (most recent call last):
  File "c:\python37-32\lib\site-packages\comiccrawler\crawler.py", line 336, in error_loop
    process()
  File "c:\python37-32\lib\site-packages\comiccrawler\crawler.py", line 287, in download
    crawler.init()
  File "c:\python37-32\lib\site-packages\comiccrawler\crawler.py", line 51, in init
    self.init_images(self.ep.current_page - 1)
  File "c:\python37-32\lib\site-packages\comiccrawler\crawler.py", line 58, in init_images
    self.get_images()
  File "c:\python37-32\lib\site-packages\comiccrawler\crawler.py", line 186, in get_images
    self.ep.current_url
  File "c:\python37-32\lib\site-packages\comiccrawler\mods\instagram.py", line 72, in get_images
    media = get_init_data(html, "PostPage")["shortcode_media"]
  File "c:\python37-32\lib\site-packages\comiccrawler\mods\instagram.py", line 69, in get_init_data
    return shared_data["entry_data"][page][0]["graphql"]
KeyError: 'PostPage'
Traceback (most recent call last):
  File "c:\python37-32\lib\site-packages\comiccrawler\crawler.py", line 336, in error_loop
    process()
  File "c:\python37-32\lib\site-packages\comiccrawler\crawler.py", line 287, in download
    crawler.init()
  File "c:\python37-32\lib\site-packages\comiccrawler\crawler.py", line 51, in init
    self.init_images(self.ep.current_page - 1)
  File "c:\python37-32\lib\site-packages\comiccrawler\crawler.py", line 58, in init_images
    self.get_images()
  File "c:\python37-32\lib\site-packages\comiccrawler\crawler.py", line 186, in get_images
    self.ep.current_url
  File "c:\python37-32\lib\site-packages\comiccrawler\mods\instagram.py", line 72, in get_images
    media = get_init_data(html, "PostPage")["shortcode_media"]
  File "c:\python37-32\lib\site-packages\comiccrawler\mods\instagram.py", line 69, in get_init_data
    return shared_data["entry_data"][page][0]["graphql"]
KeyError: 'PostPage'
Traceback (most recent call last):
  File "c:\python37-32\lib\site-packages\comiccrawler\crawler.py", line 336, in error_loop
    process()
  File "c:\python37-32\lib\site-packages\comiccrawler\crawler.py", line 287, in download
    crawler.init()
  File "c:\python37-32\lib\site-packages\comiccrawler\crawler.py", line 51, in init
    self.init_images(self.ep.current_page - 1)
  File "c:\python37-32\lib\site-packages\comiccrawler\crawler.py", line 58, in init_images
    self.get_images()
  File "c:\python37-32\lib\site-packages\comiccrawler\crawler.py", line 186, in get_images
    self.ep.current_url
  File "c:\python37-32\lib\site-packages\comiccrawler\mods\instagram.py", line 72, in get_images
    media = get_init_data(html, "PostPage")["shortcode_media"]
  File "c:\python37-32\lib\site-packages\comiccrawler\mods\instagram.py", line 69, in get_init_data
    return shared_data["entry_data"][page][0]["graphql"]
KeyError: 'PostPage'
Something bad happened, skip the episode.
Downloading ep CABW2-IpEQS
Traceback (most recent call last):
  File "c:\python37-32\lib\site-packages\comiccrawler\crawler.py", line 336, in error_loop
    process()
  File "c:\python37-32\lib\site-packages\comiccrawler\crawler.py", line 287, in download
    crawler.init()
  File "c:\python37-32\lib\site-packages\comiccrawler\crawler.py", line 51, in init
    self.init_images(self.ep.current_page - 1)
  File "c:\python37-32\lib\site-packages\comiccrawler\crawler.py", line 58, in init_images
    self.get_images()
  File "c:\python37-32\lib\site-packages\comiccrawler\crawler.py", line 186, in get_images
    self.ep.current_url
  File "c:\python37-32\lib\site-packages\comiccrawler\mods\instagram.py", line 72, in get_images
    media = get_init_data(html, "PostPage")["shortcode_media"]
  File "c:\python37-32\lib\site-packages\comiccrawler\mods\instagram.py", line 69, in get_init_data
    return shared_data["entry_data"][page][0]["graphql"]
KeyError: 'PostPage'
Traceback (most recent call last):
  File "c:\python37-32\lib\site-packages\comiccrawler\crawler.py", line 336, in error_loop
    process()
  File "c:\python37-32\lib\site-packages\comiccrawler\crawler.py", line 287, in download
    crawler.init()
  File "c:\python37-32\lib\site-packages\comiccrawler\crawler.py", line 51, in init
    self.init_images(self.ep.current_page - 1)
  File "c:\python37-32\lib\site-packages\comiccrawler\crawler.py", line 58, in init_images
    self.get_images()
  File "c:\python37-32\lib\site-packages\comiccrawler\crawler.py", line 186, in get_images
    self.ep.current_url
  File "c:\python37-32\lib\site-packages\comiccrawler\mods\instagram.py", line 72, in get_images
    media = get_init_data(html, "PostPage")["shortcode_media"]
  File "c:\python37-32\lib\site-packages\comiccrawler\mods\instagram.py", line 69, in get_init_data
    return shared_data["entry_data"][page][0]["graphql"]
KeyError: 'PostPage'
Traceback (most recent call last):
  File "c:\python37-32\lib\site-packages\comiccrawler\crawler.py", line 336, in error_loop
    process()
  File "c:\python37-32\lib\site-packages\comiccrawler\crawler.py", line 287, in download
    crawler.init()
  File "c:\python37-32\lib\site-packages\comiccrawler\crawler.py", line 51, in init
    self.init_images(self.ep.current_page - 1)
  File "c:\python37-32\lib\site-packages\comiccrawler\crawler.py", line 58, in init_images
    self.get_images()
  File "c:\python37-32\lib\site-packages\comiccrawler\crawler.py", line 186, in get_images
    self.ep.current_url
  File "c:\python37-32\lib\site-packages\comiccrawler\mods\instagram.py", line 72, in get_images
    media = get_init_data(html, "PostPage")["shortcode_media"]
  File "c:\python37-32\lib\site-packages\comiccrawler\mods\instagram.py", line 69, in get_init_data
    return shared_data["entry_data"][page][0]["graphql"]
KeyError: 'PostPage'
Something bad happened, skip the episode.
Downloading ep CA7XSw6pkdJ
Traceback (most recent call last):
  File "c:\python37-32\lib\site-packages\comiccrawler\crawler.py", line 336, in error_loop
    process()
  File "c:\python37-32\lib\site-packages\comiccrawler\crawler.py", line 287, in download
    crawler.init()
  File "c:\python37-32\lib\site-packages\comiccrawler\crawler.py", line 51, in init
    self.init_images(self.ep.current_page - 1)
  File "c:\python37-32\lib\site-packages\comiccrawler\crawler.py", line 58, in init_images
    self.get_images()
  File "c:\python37-32\lib\site-packages\comiccrawler\crawler.py", line 186, in get_images
    self.ep.current_url
  File "c:\python37-32\lib\site-packages\comiccrawler\mods\instagram.py", line 72, in get_images
    media = get_init_data(html, "PostPage")["shortcode_media"]
  File "c:\python37-32\lib\site-packages\comiccrawler\mods\instagram.py", line 69, in get_init_data
    return shared_data["entry_data"][page][0]["graphql"]
KeyError: 'PostPage'
Traceback (most recent call last):
  File "c:\python37-32\lib\site-packages\comiccrawler\crawler.py", line 336, in error_loop
    process()
  File "c:\python37-32\lib\site-packages\comiccrawler\crawler.py", line 287, in download
    crawler.init()
  File "c:\python37-32\lib\site-packages\comiccrawler\crawler.py", line 51, in init
    self.init_images(self.ep.current_page - 1)
  File "c:\python37-32\lib\site-packages\comiccrawler\crawler.py", line 58, in init_images
    self.get_images()
  File "c:\python37-32\lib\site-packages\comiccrawler\crawler.py", line 186, in get_images
    self.ep.current_url
  File "c:\python37-32\lib\site-packages\comiccrawler\mods\instagram.py", line 72, in get_images
    media = get_init_data(html, "PostPage")["shortcode_media"]
  File "c:\python37-32\lib\site-packages\comiccrawler\mods\instagram.py", line 69, in get_init_data
    return shared_data["entry_data"][page][0]["graphql"]
KeyError: 'PostPage'
Traceback (most recent call last):
  File "c:\python37-32\lib\site-packages\comiccrawler\crawler.py", line 336, in error_loop
    process()
  File "c:\python37-32\lib\site-packages\comiccrawler\crawler.py", line 287, in download
    crawler.init()
  File "c:\python37-32\lib\site-packages\comiccrawler\crawler.py", line 51, in init
    self.init_images(self.ep.current_page - 1)
  File "c:\python37-32\lib\site-packages\comiccrawler\crawler.py", line 58, in init_images
    self.get_images()
  File "c:\python37-32\lib\site-packages\comiccrawler\crawler.py", line 186, in get_images
    self.ep.current_url
  File "c:\python37-32\lib\site-packages\comiccrawler\mods\instagram.py", line 72, in get_images
    media = get_init_data(html, "PostPage")["shortcode_media"]
  File "c:\python37-32\lib\site-packages\comiccrawler\mods\instagram.py", line 69, in get_init_data
    return shared_data["entry_data"][page][0]["graphql"]
KeyError: 'PostPage'
Something bad happened, skip the episode.
Downloading ep CBtf9tFptlA
Traceback (most recent call last):
  File "c:\python37-32\lib\site-packages\comiccrawler\crawler.py", line 336, in error_loop
    process()
  File "c:\python37-32\lib\site-packages\comiccrawler\crawler.py", line 287, in download
    crawler.init()
  File "c:\python37-32\lib\site-packages\comiccrawler\crawler.py", line 51, in init
    self.init_images(self.ep.current_page - 1)
  File "c:\python37-32\lib\site-packages\comiccrawler\crawler.py", line 58, in init_images
    self.get_images()
  File "c:\python37-32\lib\site-packages\comiccrawler\crawler.py", line 186, in get_images
    self.ep.current_url
  File "c:\python37-32\lib\site-packages\comiccrawler\mods\instagram.py", line 72, in get_images
    media = get_init_data(html, "PostPage")["shortcode_media"]
  File "c:\python37-32\lib\site-packages\comiccrawler\mods\instagram.py", line 69, in get_init_data
    return shared_data["entry_data"][page][0]["graphql"]
KeyError: 'PostPage'
Traceback (most recent call last):
  File "c:\python37-32\lib\site-packages\comiccrawler\crawler.py", line 336, in error_loop
    process()
  File "c:\python37-32\lib\site-packages\comiccrawler\crawler.py", line 287, in download
    crawler.init()
  File "c:\python37-32\lib\site-packages\comiccrawler\crawler.py", line 51, in init
    self.init_images(self.ep.current_page - 1)
  File "c:\python37-32\lib\site-packages\comiccrawler\crawler.py", line 58, in init_images
    self.get_images()
  File "c:\python37-32\lib\site-packages\comiccrawler\crawler.py", line 186, in get_images
    self.ep.current_url
  File "c:\python37-32\lib\site-packages\comiccrawler\mods\instagram.py", line 72, in get_images
    media = get_init_data(html, "PostPage")["shortcode_media"]
  File "c:\python37-32\lib\site-packages\comiccrawler\mods\instagram.py", line 69, in get_init_data
    return shared_data["entry_data"][page][0]["graphql"]
KeyError: 'PostPage'
Traceback (most recent call last):
  File "c:\python37-32\lib\site-packages\comiccrawler\crawler.py", line 336, in error_loop
    process()
  File "c:\python37-32\lib\site-packages\comiccrawler\crawler.py", line 287, in download
    crawler.init()
  File "c:\python37-32\lib\site-packages\comiccrawler\crawler.py", line 51, in init
    self.init_images(self.ep.current_page - 1)
  File "c:\python37-32\lib\site-packages\comiccrawler\crawler.py", line 58, in init_images
    self.get_images()
  File "c:\python37-32\lib\site-packages\comiccrawler\crawler.py", line 186, in get_images
    self.ep.current_url
  File "c:\python37-32\lib\site-packages\comiccrawler\mods\instagram.py", line 72, in get_images
    media = get_init_data(html, "PostPage")["shortcode_media"]
  File "c:\python37-32\lib\site-packages\comiccrawler\mods\instagram.py", line 69, in get_init_data
    return shared_data["entry_data"][page][0]["graphql"]
KeyError: 'PostPage'
Something bad happened, skip the episode.
Downloading ep CByoFp6pEOE
Traceback (most recent call last):
  File "c:\python37-32\lib\site-packages\comiccrawler\crawler.py", line 336, in error_loop
    process()
  File "c:\python37-32\lib\site-packages\comiccrawler\crawler.py", line 287, in download
    crawler.init()
  File "c:\python37-32\lib\site-packages\comiccrawler\crawler.py", line 51, in init
    self.init_images(self.ep.current_page - 1)
  File "c:\python37-32\lib\site-packages\comiccrawler\crawler.py", line 58, in init_images
    self.get_images()
  File "c:\python37-32\lib\site-packages\comiccrawler\crawler.py", line 186, in get_images
    self.ep.current_url
  File "c:\python37-32\lib\site-packages\comiccrawler\mods\instagram.py", line 72, in get_images
    media = get_init_data(html, "PostPage")["shortcode_media"]
  File "c:\python37-32\lib\site-packages\comiccrawler\mods\instagram.py", line 69, in get_init_data
    return shared_data["entry_data"][page][0]["graphql"]
KeyError: 'PostPage'
No batch analyzer exists
Traceback (most recent call last):
  File "c:\python37-32\lib\site-packages\comiccrawler\crawler.py", line 336, in error_loop
    process()
  File "c:\python37-32\lib\site-packages\comiccrawler\crawler.py", line 287, in download
    crawler.init()
  File "c:\python37-32\lib\site-packages\comiccrawler\crawler.py", line 51, in init
    self.init_images(self.ep.current_page - 1)
  File "c:\python37-32\lib\site-packages\comiccrawler\crawler.py", line 58, in init_images
    self.get_images()
  File "c:\python37-32\lib\site-packages\comiccrawler\crawler.py", line 186, in get_images
    self.ep.current_url
  File "c:\python37-32\lib\site-packages\comiccrawler\mods\instagram.py", line 72, in get_images
    media = get_init_data(html, "PostPage")["shortcode_media"]
  File "c:\python37-32\lib\site-packages\comiccrawler\mods\instagram.py", line 69, in get_init_data
    return shared_data["entry_data"][page][0]["graphql"]
KeyError: 'PostPage'
Stop downloading
停
eight04 commented 3 years ago

錯誤訊息和目前的原始碼對不上。你用的是最新版本嗎?

bluelovers commented 3 years ago
Start download [instagram] HIKO(@hiko_wan)• Instagram 相片與影片
Start downloading [instagram] HIKO(@hiko_wan)• Instagram 相片與影片
total 1860 episode.
Downloading ep wv0WzyDYsD
Traceback (most recent call last):
  File "c:\python38\lib\site-packages\comiccrawler\crawler.py", line 338, in error_loop
    process()
  File "c:\python38\lib\site-packages\comiccrawler\crawler.py", line 289, in download
    crawler.init()
  File "c:\python38\lib\site-packages\comiccrawler\crawler.py", line 51, in init
    self.init_images(self.ep.current_page - 1)
  File "c:\python38\lib\site-packages\comiccrawler\crawler.py", line 57, in init_images
    self.get_html()
  File "c:\python38\lib\site-packages\comiccrawler\crawler.py", line 177, in get_html
    self.html = self.downloader.html(self.ep.current_url, referer=self.mission.url)
  File "c:\python38\lib\site-packages\comiccrawler\module_grabber.py", line 11,
in html
    return self.grab(grabhtml, url, **kwargs)
  File "c:\python38\lib\site-packages\comiccrawler\module_grabber.py", line 17,
in grab
    return grabber(
  File "c:\python38\lib\site-packages\comiccrawler\grabber.py", line 151, in grabhtml
    r = grabber(*args, **kwargs)
  File "c:\python38\lib\site-packages\comiccrawler\grabber.py", line 105, in grabber
    r = await_(do_request, s, url, proxies, retry, **kwargs)
  File "c:\python38\lib\site-packages\worker\__init__.py", line 905, in wrapped
    return f(*args, **kwargs)
  File "c:\python38\lib\site-packages\worker\__init__.py", line 927, in await_
    return async_(callback, *args, **kwargs).get()
  File "c:\python38\lib\site-packages\worker\__init__.py", line 682, in get
    raise err
  File "c:\python38\lib\site-packages\worker\__init__.py", line 474, in wrap_worker
    self.ret = self.worker(*args, **kwargs)
  File "c:\python38\lib\site-packages\comiccrawler\grabber.py", line 131, in do_request
    r.raise_for_status()
  File "c:\python38\lib\site-packages\requests\models.py", line 941, in raise_for_status
    raise HTTPError(http_error_msg, response=self)
requests.exceptions.HTTPError: 429 Client Error: - for url: https://www.instagram.com/challenge/?next=/p/wv0WzyDYsD/
Thread crashed: <function DownloadManager.start_download.<locals>.do_download at 0x00000255304F7E50>
Traceback (most recent call last):
  File "c:\python38\lib\site-packages\worker\__init__.py", line 474, in wrap_worker
    self.ret = self.worker(*args, **kwargs)
  File "c:\python38\lib\site-packages\comiccrawler\download_manager.py", line 127, in do_download
    download(mission, profile(mission.module.config["savepath"]))
  File "c:\python38\lib\site-packages\comiccrawler\crawler.py", line 228, in download
    crawl(mission, savepath)
  File "c:\python38\lib\site-packages\comiccrawler\crawler.py", line 266, in crawl
    crawlpage(crawler)
  File "c:\python38\lib\site-packages\comiccrawler\crawler.py", line 331, in crawlpage
    error_loop(download, download_error)
  File "c:\python38\lib\site-packages\comiccrawler\crawler.py", line 345, in error_loop
    handle_error(er)
  File "c:\python38\lib\site-packages\comiccrawler\crawler.py", line 327, in download_error
    raise er
  File "c:\python38\lib\site-packages\comiccrawler\crawler.py", line 338, in error_loop
    process()
  File "c:\python38\lib\site-packages\comiccrawler\crawler.py", line 289, in download
    crawler.init()
  File "c:\python38\lib\site-packages\comiccrawler\crawler.py", line 51, in init
    self.init_images(self.ep.current_page - 1)
  File "c:\python38\lib\site-packages\comiccrawler\crawler.py", line 57, in init_images
    self.get_html()
  File "c:\python38\lib\site-packages\comiccrawler\crawler.py", line 177, in get_html
    self.html = self.downloader.html(self.ep.current_url, referer=self.mission.url)
  File "c:\python38\lib\site-packages\comiccrawler\module_grabber.py", line 11,
in html
    return self.grab(grabhtml, url, **kwargs)
  File "c:\python38\lib\site-packages\comiccrawler\module_grabber.py", line 17,
in grab
    return grabber(
  File "c:\python38\lib\site-packages\comiccrawler\grabber.py", line 151, in grabhtml
    r = grabber(*args, **kwargs)
  File "c:\python38\lib\site-packages\comiccrawler\grabber.py", line 105, in grabber
    r = await_(do_request, s, url, proxies, retry, **kwargs)
  File "c:\python38\lib\site-packages\worker\__init__.py", line 905, in wrapped
    return f(*args, **kwargs)
  File "c:\python38\lib\site-packages\worker\__init__.py", line 927, in await_
    return async_(callback, *args, **kwargs).get()
  File "c:\python38\lib\site-packages\worker\__init__.py", line 682, in get
    raise err
  File "c:\python38\lib\site-packages\worker\__init__.py", line 474, in wrap_worker
    self.ret = self.worker(*args, **kwargs)
  File "c:\python38\lib\site-packages\comiccrawler\grabber.py", line 131, in do_request
    r.raise_for_status()
  File "c:\python38\lib\site-packages\requests\models.py", line 941, in raise_for_status
    raise HTTPError(http_error_msg, response=self)
requests.exceptions.HTTPError: 429 Client Error: - for url: https://www.instagram.com/challenge/?next=/p/wv0WzyDYsD/
Start download [instagram] Instagram 上的 Tinneh || cosplay(@tibytinneh) • 927 張/段相片和影片
Start downloading [instagram] Instagram 上的 Tinneh || cosplay(@tibytinneh) • 927 張/段相片和影片
total 927 episode.
Downloading ep k4-eGCI55a
Traceback (most recent call last):
  File "c:\python38\lib\site-packages\comiccrawler\crawler.py", line 338, in error_loop
    process()
  File "c:\python38\lib\site-packages\comiccrawler\crawler.py", line 289, in download
    crawler.init()
  File "c:\python38\lib\site-packages\comiccrawler\crawler.py", line 51, in init
    self.init_images(self.ep.current_page - 1)
  File "c:\python38\lib\site-packages\comiccrawler\crawler.py", line 57, in init_images
    self.get_html()
  File "c:\python38\lib\site-packages\comiccrawler\crawler.py", line 177, in get_html
    self.html = self.downloader.html(self.ep.current_url, referer=self.mission.url)
  File "c:\python38\lib\site-packages\comiccrawler\module_grabber.py", line 11,
in html
    return self.grab(grabhtml, url, **kwargs)
  File "c:\python38\lib\site-packages\comiccrawler\module_grabber.py", line 17,
in grab
    return grabber(
  File "c:\python38\lib\site-packages\comiccrawler\grabber.py", line 151, in grabhtml
    r = grabber(*args, **kwargs)
  File "c:\python38\lib\site-packages\comiccrawler\grabber.py", line 105, in grabber
    r = await_(do_request, s, url, proxies, retry, **kwargs)
  File "c:\python38\lib\site-packages\worker\__init__.py", line 905, in wrapped
    return f(*args, **kwargs)
  File "c:\python38\lib\site-packages\worker\__init__.py", line 927, in await_
    return async_(callback, *args, **kwargs).get()
  File "c:\python38\lib\site-packages\worker\__init__.py", line 682, in get
    raise err
  File "c:\python38\lib\site-packages\worker\__init__.py", line 474, in wrap_worker
    self.ret = self.worker(*args, **kwargs)
  File "c:\python38\lib\site-packages\comiccrawler\grabber.py", line 131, in do_request
    r.raise_for_status()
  File "c:\python38\lib\site-packages\requests\models.py", line 941, in raise_for_status
    raise HTTPError(http_error_msg, response=self)
requests.exceptions.HTTPError: 429 Client Error: - for url: https://www.instagram.com/challenge/?next=/p/k4-eGCI55a/
Session saved
Thread crashed: <function DownloadManager.start_download.<locals>.do_download at 0x0000025530900280>
Traceback (most recent call last):
  File "c:\python38\lib\site-packages\worker\__init__.py", line 474, in wrap_worker
    self.ret = self.worker(*args, **kwargs)
  File "c:\python38\lib\site-packages\comiccrawler\download_manager.py", line 127, in do_download
    download(mission, profile(mission.module.config["savepath"]))
  File "c:\python38\lib\site-packages\comiccrawler\crawler.py", line 228, in download
    crawl(mission, savepath)
  File "c:\python38\lib\site-packages\comiccrawler\crawler.py", line 266, in crawl
    crawlpage(crawler)
  File "c:\python38\lib\site-packages\comiccrawler\crawler.py", line 331, in crawlpage
    error_loop(download, download_error)
  File "c:\python38\lib\site-packages\comiccrawler\crawler.py", line 345, in error_loop
    handle_error(er)
  File "c:\python38\lib\site-packages\comiccrawler\crawler.py", line 327, in download_error
    raise er
  File "c:\python38\lib\site-packages\comiccrawler\crawler.py", line 338, in error_loop
    process()
  File "c:\python38\lib\site-packages\comiccrawler\crawler.py", line 289, in download
    crawler.init()
  File "c:\python38\lib\site-packages\comiccrawler\crawler.py", line 51, in init
    self.init_images(self.ep.current_page - 1)
  File "c:\python38\lib\site-packages\comiccrawler\crawler.py", line 57, in init_images
    self.get_html()
  File "c:\python38\lib\site-packages\comiccrawler\crawler.py", line 177, in get_html
    self.html = self.downloader.html(self.ep.current_url, referer=self.mission.url)
  File "c:\python38\lib\site-packages\comiccrawler\module_grabber.py", line 11,
in html
    return self.grab(grabhtml, url, **kwargs)
  File "c:\python38\lib\site-packages\comiccrawler\module_grabber.py", line 17,
in grab
    return grabber(
  File "c:\python38\lib\site-packages\comiccrawler\grabber.py", line 151, in grabhtml
    r = grabber(*args, **kwargs)
  File "c:\python38\lib\site-packages\comiccrawler\grabber.py", line 105, in grabber
    r = await_(do_request, s, url, proxies, retry, **kwargs)
  File "c:\python38\lib\site-packages\worker\__init__.py", line 905, in wrapped
    return f(*args, **kwargs)
  File "c:\python38\lib\site-packages\worker\__init__.py", line 927, in await_
    return async_(callback, *args, **kwargs).get()
  File "c:\python38\lib\site-packages\worker\__init__.py", line 682, in get
    raise err
  File "c:\python38\lib\site-packages\worker\__init__.py", line 474, in wrap_worker
    self.ret = self.worker(*args, **kwargs)
  File "c:\python38\lib\site-packages\comiccrawler\grabber.py", line 131, in do_request
    r.raise_for_status()
  File "c:\python38\lib\site-packages\requests\models.py", line 941, in raise_for_status
    raise HTTPError(http_error_msg, response=self)
requests.exceptions.HTTPError: 429 Client Error: - for url: https://www.instagram.com/challenge/?next=/p/k4-eGCI55a/
Start download [instagram] 宮本彩希(@sakimi_cos)• Instagram 相片與影片
Start downloading [instagram] 宮本彩希(@sakimi_cos)• Instagram 相片與影片
total 8 episode.
Downloading ep B89cWXtnvEa
Traceback (most recent call last):
  File "c:\python38\lib\site-packages\comiccrawler\crawler.py", line 338, in error_loop
    process()
  File "c:\python38\lib\site-packages\comiccrawler\crawler.py", line 289, in download
    crawler.init()
  File "c:\python38\lib\site-packages\comiccrawler\crawler.py", line 51, in init
    self.init_images(self.ep.current_page - 1)
  File "c:\python38\lib\site-packages\comiccrawler\crawler.py", line 57, in init_images
    self.get_html()
  File "c:\python38\lib\site-packages\comiccrawler\crawler.py", line 177, in get_html
    self.html = self.downloader.html(self.ep.current_url, referer=self.mission.url)
  File "c:\python38\lib\site-packages\comiccrawler\module_grabber.py", line 11,
in html
    return self.grab(grabhtml, url, **kwargs)
  File "c:\python38\lib\site-packages\comiccrawler\module_grabber.py", line 17,
in grab
    return grabber(
  File "c:\python38\lib\site-packages\comiccrawler\grabber.py", line 151, in grabhtml
    r = grabber(*args, **kwargs)
  File "c:\python38\lib\site-packages\comiccrawler\grabber.py", line 105, in grabber
    r = await_(do_request, s, url, proxies, retry, **kwargs)
  File "c:\python38\lib\site-packages\worker\__init__.py", line 905, in wrapped
    return f(*args, **kwargs)
  File "c:\python38\lib\site-packages\worker\__init__.py", line 927, in await_
    return async_(callback, *args, **kwargs).get()
  File "c:\python38\lib\site-packages\worker\__init__.py", line 682, in get
    raise err
  File "c:\python38\lib\site-packages\worker\__init__.py", line 474, in wrap_worker
    self.ret = self.worker(*args, **kwargs)
  File "c:\python38\lib\site-packages\comiccrawler\grabber.py", line 131, in do_request
    r.raise_for_status()
  File "c:\python38\lib\site-packages\requests\models.py", line 941, in raise_for_status
    raise HTTPError(http_error_msg, response=self)
requests.exceptions.HTTPError: 429 Client Error: - for url: https://www.instagram.com/challenge/?next=/p/B89cWXtnvEa/
Stop downloading
停止下載
bluelovers commented 3 years ago

使用自己帳號

https://www.instagram.com/elfin_cya/ => https://www.instagram.com/challenge/?next=/elfin_cya/

image

使用無痕模式

https://www.instagram.com/elfin_cya/

image

bluelovers commented 3 years ago

我認為因為任務機制的無限迴圈( a 失敗 -> b 失敗 -> c 失敗 -> 又回到 a 再來一次 ) 導致 帳號會被判定為 bot 因此而觸發網頁保護機制

https://github.com/eight04/ComicCrawler/issues/250

eight04 commented 3 years ago

帳號會被判定為 bot 因此而觸發網頁保護機制

是下載失敗在先,還是帳號被判定為 bot 在先?如果被判定為 bot 而下載失敗很正常,若是換一個能正常瀏覽的帳號呢?


https://github.com/eight04/ComicCrawler/issues/283#issuecomment-707339267 的程式碼和最新版本不同,你用的是最新版嗎?執行

comiccrawler --version

可以看現在執行的 comiccrawler 版本

bluelovers commented 3 years ago

是失敗在先吧 所以被判定成BOT 然後又因此 變成全部失敗 畢竟在以前根本沒用過其他東西登入IG

version 2020.10.29

eight04 commented 3 years ago

version 2020.10.29

那你的系統中可能安裝了兩個版本以上的 Comic Crawler 或 python。我建議先檢查是否安裝了兩個以上的 python︰

  1. 在應用程式列表檢查是否有兩個以上的 python
  2. 在 cmd 中輸入 where python,會列出 python 執行檔的路徑,確認路徑是否合當初安裝 python 時所設定的路徑相同。

最後將 Comic Crawler 移除後重裝一次︰

pip uninstall comiccrawler
pip install comiccrawler

是失敗在先吧 所以被判定成BOT 然後又因此 變成全部失敗

如果是失敗在先的話,必須找一個沒有被 ban 的帳號測試,以取得最初失敗的錯誤。429 Client Error 是被 ban 的錯誤訊息,看到這個錯誤就可以暫停下載了。我不知道 IG 針對一個帳號/IP會 ban 多久,你可以過一陣子用瀏覽器測試看能不能正常瀏覽,若可以再嘗試使用 Comic Crawler 下載。

bluelovers commented 3 years ago

將電腦內找到的 py 版本全部都刪除並且重開機了

利用瀏覽器也能看見 ig 內容

image

但用 Comic Crawler 的話 就會出現錯誤

Downloading ep B9LhpSwHtZb
Traceback (most recent call last):
  File "c:\python39\lib\site-packages\comiccrawler\crawler.py", line 338, in error_loop
    process()
  File "c:\python39\lib\site-packages\comiccrawler\crawler.py", line 289, in download
    crawler.init()
  File "c:\python39\lib\site-packages\comiccrawler\crawler.py", line 51, in init
    self.init_images(self.ep.current_page - 1)
  File "c:\python39\lib\site-packages\comiccrawler\crawler.py", line 58, in init_images
    self.get_images()
  File "c:\python39\lib\site-packages\comiccrawler\crawler.py", line 186, in get_images
    images = self.mod.get_images(
  File "c:\python39\lib\site-packages\comiccrawler\mods\instagram.py", line 80,
in get_images
    media = get_extra_data(html)["shortcode_media"]
  File "c:\python39\lib\site-packages\comiccrawler\mods\instagram.py", line 75,
in get_extra_data
    text = re.search("window\.__additionalDataLoaded\('[^']+',(.*?)\);</script>", html).group(1)
AttributeError: 'NoneType' object has no attribute 'group'
Traceback (most recent call last):
  File "c:\python39\lib\site-packages\comiccrawler\crawler.py", line 338, in error_loop
    process()
  File "c:\python39\lib\site-packages\comiccrawler\crawler.py", line 289, in download
    crawler.init()
  File "c:\python39\lib\site-packages\comiccrawler\crawler.py", line 51, in init
    self.init_images(self.ep.current_page - 1)
  File "c:\python39\lib\site-packages\comiccrawler\crawler.py", line 58, in init_images
    self.get_images()
  File "c:\python39\lib\site-packages\comiccrawler\crawler.py", line 186, in get_images
    images = self.mod.get_images(
  File "c:\python39\lib\site-packages\comiccrawler\mods\instagram.py", line 80,
in get_images
    media = get_extra_data(html)["shortcode_media"]
  File "c:\python39\lib\site-packages\comiccrawler\mods\instagram.py", line 75,
in get_extra_data
    text = re.search("window\.__additionalDataLoaded\('[^']+',(.*?)\);</script>", html).group(1)
AttributeError: 'NoneType' object has no attribute 'group'
Traceback (most recent call last):
  File "c:\python39\lib\site-packages\comiccrawler\crawler.py", line 338, in error_loop
    process()
  File "c:\python39\lib\site-packages\comiccrawler\crawler.py", line 289, in download
    crawler.init()
  File "c:\python39\lib\site-packages\comiccrawler\crawler.py", line 51, in init
    self.init_images(self.ep.current_page - 1)
  File "c:\python39\lib\site-packages\comiccrawler\crawler.py", line 58, in init_images
    self.get_images()
  File "c:\python39\lib\site-packages\comiccrawler\crawler.py", line 186, in get_images
    images = self.mod.get_images(
  File "c:\python39\lib\site-packages\comiccrawler\mods\instagram.py", line 80,
in get_images
    media = get_extra_data(html)["shortcode_media"]
  File "c:\python39\lib\site-packages\comiccrawler\mods\instagram.py", line 75,
in get_extra_data
    text = re.search("window\.__additionalDataLoaded\('[^']+',(.*?)\);</script>", html).group(1)
AttributeError: 'NoneType' object has no attribute 'group'
Something bad happened, skip the episode.
Downloading ep B9MEUYAn-9t
Traceback (most recent call last):
  File "c:\python39\lib\site-packages\comiccrawler\crawler.py", line 338, in error_loop
    process()
  File "c:\python39\lib\site-packages\comiccrawler\crawler.py", line 289, in download
    crawler.init()
  File "c:\python39\lib\site-packages\comiccrawler\crawler.py", line 51, in init
    self.init_images(self.ep.current_page - 1)
  File "c:\python39\lib\site-packages\comiccrawler\crawler.py", line 58, in init_images
    self.get_images()
  File "c:\python39\lib\site-packages\comiccrawler\crawler.py", line 186, in get_images
    images = self.mod.get_images(
  File "c:\python39\lib\site-packages\comiccrawler\mods\instagram.py", line 80,
in get_images
    media = get_extra_data(html)["shortcode_media"]
  File "c:\python39\lib\site-packages\comiccrawler\mods\instagram.py", line 75,
in get_extra_data
    text = re.search("window\.__additionalDataLoaded\('[^']+',(.*?)\);</script>", html).group(1)
AttributeError: 'NoneType' object has no attribute 'group'
Traceback (most recent call last):
  File "c:\python39\lib\site-packages\comiccrawler\crawler.py", line 338, in error_loop
    process()
  File "c:\python39\lib\site-packages\comiccrawler\crawler.py", line 289, in download
    crawler.init()
  File "c:\python39\lib\site-packages\comiccrawler\crawler.py", line 51, in init
    self.init_images(self.ep.current_page - 1)
  File "c:\python39\lib\site-packages\comiccrawler\crawler.py", line 58, in init_images
    self.get_images()
  File "c:\python39\lib\site-packages\comiccrawler\crawler.py", line 186, in get_images
    images = self.mod.get_images(
  File "c:\python39\lib\site-packages\comiccrawler\mods\instagram.py", line 80,
in get_images
    media = get_extra_data(html)["shortcode_media"]
  File "c:\python39\lib\site-packages\comiccrawler\mods\instagram.py", line 75,
in get_extra_data
    text = re.search("window\.__additionalDataLoaded\('[^']+',(.*?)\);</script>", html).group(1)
AttributeError: 'NoneType' object has no attribute 'group'
Stop downloading
bluelovers commented 3 years ago
Downloading ep B-v_6YoJ6LY
Traceback (most recent call last):
  File "c:\python39\lib\site-packages\comiccrawler\crawler.py", line 338, in error_loop
    process()
  File "c:\python39\lib\site-packages\comiccrawler\crawler.py", line 289, in download
    crawler.init()
  File "c:\python39\lib\site-packages\comiccrawler\crawler.py", line 51, in init
    self.init_images(self.ep.current_page - 1)
  File "c:\python39\lib\site-packages\comiccrawler\crawler.py", line 58, in init_images
    self.get_images()
  File "c:\python39\lib\site-packages\comiccrawler\crawler.py", line 186, in get_images
    images = self.mod.get_images(
  File "c:\python39\lib\site-packages\comiccrawler\mods\instagram.py", line 80, in get_images
    media = get_extra_data(html)["shortcode_media"]
  File "c:\python39\lib\site-packages\comiccrawler\mods\instagram.py", line 75, in get_extra_data
    text = re.search("window\.__additionalDataLoaded\('[^']+',(.*?)\);</script>", html).group(1)
AttributeError: 'NoneType' object has no attribute 'group'
Traceback (most recent call last):
  File "c:\python39\lib\site-packages\comiccrawler\crawler.py", line 338, in error_loop
    process()
  File "c:\python39\lib\site-packages\comiccrawler\crawler.py", line 289, in download
    crawler.init()
  File "c:\python39\lib\site-packages\comiccrawler\crawler.py", line 51, in init
    self.init_images(self.ep.current_page - 1)
  File "c:\python39\lib\site-packages\comiccrawler\crawler.py", line 58, in init_images
    self.get_images()
  File "c:\python39\lib\site-packages\comiccrawler\crawler.py", line 186, in get_images
    images = self.mod.get_images(
  File "c:\python39\lib\site-packages\comiccrawler\mods\instagram.py", line 80, in get_images
    media = get_extra_data(html)["shortcode_media"]
  File "c:\python39\lib\site-packages\comiccrawler\mods\instagram.py", line 75, in get_extra_data
    text = re.search("window\.__additionalDataLoaded\('[^']+',(.*?)\);</script>", html).group(1)
AttributeError: 'NoneType' object has no attribute 'group'
Traceback (most recent call last):
  File "c:\python39\lib\site-packages\comiccrawler\crawler.py", line 338, in error_loop
    process()
  File "c:\python39\lib\site-packages\comiccrawler\crawler.py", line 289, in download
    crawler.init()
  File "c:\python39\lib\site-packages\comiccrawler\crawler.py", line 51, in init
    self.init_images(self.ep.current_page - 1)
  File "c:\python39\lib\site-packages\comiccrawler\crawler.py", line 58, in init_images
    self.get_images()
  File "c:\python39\lib\site-packages\comiccrawler\crawler.py", line 186, in get_images
    images = self.mod.get_images(
  File "c:\python39\lib\site-packages\comiccrawler\mods\instagram.py", line 80, in get_images
    media = get_extra_data(html)["shortcode_media"]
  File "c:\python39\lib\site-packages\comiccrawler\mods\instagram.py", line 75, in get_extra_data
    text = re.search("window\.__additionalDataLoaded\('[^']+',(.*?)\);</script>", html).group(1)
AttributeError: 'NoneType' object has no attribute 'group'
Something bad happened, skip the episode.
Downloading ep B-03J_KBd_I
Traceback (most recent call last):
  File "c:\python39\lib\site-packages\comiccrawler\crawler.py", line 338, in error_loop
    process()
  File "c:\python39\lib\site-packages\comiccrawler\crawler.py", line 289, in download
    crawler.init()
  File "c:\python39\lib\site-packages\comiccrawler\crawler.py", line 51, in init
    self.init_images(self.ep.current_page - 1)
  File "c:\python39\lib\site-packages\comiccrawler\crawler.py", line 58, in init_images
    self.get_images()
  File "c:\python39\lib\site-packages\comiccrawler\crawler.py", line 186, in get_images
    images = self.mod.get_images(
  File "c:\python39\lib\site-packages\comiccrawler\mods\instagram.py", line 80, in get_images
    media = get_extra_data(html)["shortcode_media"]
  File "c:\python39\lib\site-packages\comiccrawler\mods\instagram.py", line 75, in get_extra_data
    text = re.search("window\.__additionalDataLoaded\('[^']+',(.*?)\);</script>", html).group(1)
AttributeError: 'NoneType' object has no attribute 'group'
Traceback (most recent call last):
  File "c:\python39\lib\site-packages\comiccrawler\crawler.py", line 338, in error_loop
    process()
  File "c:\python39\lib\site-packages\comiccrawler\crawler.py", line 289, in download
    crawler.init()
  File "c:\python39\lib\site-packages\comiccrawler\crawler.py", line 51, in init
    self.init_images(self.ep.current_page - 1)
  File "c:\python39\lib\site-packages\comiccrawler\crawler.py", line 58, in init_images
    self.get_images()
  File "c:\python39\lib\site-packages\comiccrawler\crawler.py", line 186, in get_images
    images = self.mod.get_images(
  File "c:\python39\lib\site-packages\comiccrawler\mods\instagram.py", line 80, in get_images
    media = get_extra_data(html)["shortcode_media"]
  File "c:\python39\lib\site-packages\comiccrawler\mods\instagram.py", line 75, in get_extra_data
    text = re.search("window\.__additionalDataLoaded\('[^']+',(.*?)\);</script>", html).group(1)
AttributeError: 'NoneType' object has no attribute 'group'
Traceback (most recent call last):
  File "c:\python39\lib\site-packages\comiccrawler\crawler.py", line 338, in error_loop
    process()
  File "c:\python39\lib\site-packages\comiccrawler\crawler.py", line 289, in download
    crawler.init()
  File "c:\python39\lib\site-packages\comiccrawler\crawler.py", line 51, in init
    self.init_images(self.ep.current_page - 1)
  File "c:\python39\lib\site-packages\comiccrawler\crawler.py", line 58, in init_images
    self.get_images()
  File "c:\python39\lib\site-packages\comiccrawler\crawler.py", line 186, in get_images
    images = self.mod.get_images(
  File "c:\python39\lib\site-packages\comiccrawler\mods\instagram.py", line 80, in get_images
    media = get_extra_data(html)["shortcode_media"]
  File "c:\python39\lib\site-packages\comiccrawler\mods\instagram.py", line 75, in get_extra_data
    text = re.search("window\.__additionalDataLoaded\('[^']+',(.*?)\);</script>", html).group(1)
AttributeError: 'NoneType' object has no attribute 'group'
Something bad happened, skip the episode.
Downloading ep B-_yy4-B9ZE
Traceback (most recent call last):
  File "c:\python39\lib\site-packages\comiccrawler\crawler.py", line 338, in error_loop
    process()
  File "c:\python39\lib\site-packages\comiccrawler\crawler.py", line 289, in download
    crawler.init()
  File "c:\python39\lib\site-packages\comiccrawler\crawler.py", line 51, in init
    self.init_images(self.ep.current_page - 1)
  File "c:\python39\lib\site-packages\comiccrawler\crawler.py", line 58, in init_images
    self.get_images()
  File "c:\python39\lib\site-packages\comiccrawler\crawler.py", line 186, in get_images
    images = self.mod.get_images(
  File "c:\python39\lib\site-packages\comiccrawler\mods\instagram.py", line 80, in get_images
    media = get_extra_data(html)["shortcode_media"]
  File "c:\python39\lib\site-packages\comiccrawler\mods\instagram.py", line 75, in get_extra_data
    text = re.search("window\.__additionalDataLoaded\('[^']+',(.*?)\);</script>", html).group(1)
AttributeError: 'NoneType' object has no attribute 'group'
Traceback (most recent call last):
  File "c:\python39\lib\site-packages\comiccrawler\crawler.py", line 338, in error_loop
    process()
  File "c:\python39\lib\site-packages\comiccrawler\crawler.py", line 289, in download
    crawler.init()
  File "c:\python39\lib\site-packages\comiccrawler\crawler.py", line 51, in init
    self.init_images(self.ep.current_page - 1)
  File "c:\python39\lib\site-packages\comiccrawler\crawler.py", line 58, in init_images
    self.get_images()
  File "c:\python39\lib\site-packages\comiccrawler\crawler.py", line 186, in get_images
    images = self.mod.get_images(
  File "c:\python39\lib\site-packages\comiccrawler\mods\instagram.py", line 80, in get_images
    media = get_extra_data(html)["shortcode_media"]
  File "c:\python39\lib\site-packages\comiccrawler\mods\instagram.py", line 75, in get_extra_data
    text = re.search("window\.__additionalDataLoaded\('[^']+',(.*?)\);</script>", html).group(1)
AttributeError: 'NoneType' object has no attribute 'group'
Stop downloading
停止下載
eight04 commented 3 years ago

測試了 https://www.instagram.com/sakimi_cos/ 結果正常