blackjack4494 / yt-dlc

media downloader and library for various sites.
The Unlicense
2.89k stars 363 forks source link

[Broken] Archive.org + playlists #269

Open blackjack4494 opened 3 years ago

blackjack4494 commented 3 years ago

Checklist

Verbose log

Playlist error output

[archive.org] WeihnachtenBeiHoppenstedts: Downloading webpage
ERROR: Unable to extract jwplayer playlist; please report this issue on https://github.com/blackjack4494/yt-dlc . Make sure you are using the latest version; see  https://github.com/blackjack4494/yt-dlc  on how to update. Be sure to call youtube-dlc with the --verbose flag and include its complete output.Traceback (most recent call last):
  File "D:\gitkraken\yt-dlc\youtube_dlc\YoutubeDL.py", line 849, in wrapper
  File "D:\gitkraken\yt-dlc\youtube_dlc\YoutubeDL.py", line 870, in __extract_info
    ie_result = ie.extract(url)
  File "D:\gitkraken\yt-dlc\youtube_dlc\extractor\common.py", line 532, in extract
    ie_result = self._real_extract(url)
  File "D:\gitkraken\yt-dlc\youtube_dlc\extractor\archiveorg.py", line 45, in _real_extract
    webpage, 'jwplayer playlist'), video_id)
  File "D:\gitkraken\yt-dlc\youtube_dlc\extractor\common.py", line 1010, in _search_regex
    raise RegexNotFoundError('Unable to extract %s' % _name)
youtube_dlc.utils.RegexNotFoundError: Unable to extract jwplayer playlist; please report this issue on https://github.com/blackjack4494/yt-dlc . Make sure you are using the latest version; see  https://github.com/blackjack4494/yt-dlc  on how to update. Be sure to call youtube-dlc with the --verbose flag and include its complete output.

Specific video

[generic] Weihnachten+bei+Hoppenstedts: Requesting header
WARNING: Falling back on generic information extractor.
[generic] Weihnachten+bei+Hoppenstedts: Downloading webpage
Traceback (most recent call last):
  File "D:\gitkraken\yt-dlc\youtube_dlc\YoutubeDL.py", line 849, in wrapper
    return func(self, *args, **kwargs)
  File "D:\gitkraken\yt-dlc\youtube_dlc\YoutubeDL.py", line 870, in __extract_info
    ie_result = ie.extract(url)
  File "D:\gitkraken\yt-dlc\youtube_dlc\extractor\common.py", line 532, in extract
    ie_result = self._real_extract(url)
  File "D:\gitkraken\yt-dlc\youtube_dlc\extractor\generic.py", line 3400, in _real_extract
    raise UnsupportedError(url)
youtube_dlc.utils.UnsupportedError: Unsupported URL: https://archive.org/details/WeihnachtenBeiHoppenstedts/Weihnachten+bei+Hoppenstedts.mp4

Description

https://archive.org/details/WeihnachtenBeiHoppenstedts
or
https://archive.org/details/WeihnachtenBeiHoppenstedts/Weihnachten+bei+Hoppenstedts.mp4
will result in errors.

There have been some related issues ytdl-org/youtube-dl#27236 ytdl-org/youtube-dl#27109 ytdl-org/youtube-dl#26780 ytdl-org/youtube-dl#25466 ytdl-org/youtube-dl#25277 ytdl-org/youtube-dl#23946 ytdl-org/youtube-dl#23700 ytdl-org/youtube-dl#23586 ytdl-org/youtube-dl#21330 ytdl-org/youtube-dl#7154

Some proposed patches ytdl-org/youtube-dl#27156 ytdl-org/youtube-dl#23827

--force-generic won't work.
I think this extractor should get some attention as archive.org is well known and got lots of stuff. However you are able to download everything fine when browsing there. But still some people may prefer to use a command-line tool or want to embed it into their (gui) projects.
This extractor had quite a lot of issues which were never addressed or simply closed as being a duplicate.

october262 commented 3 years ago

this link - https://archive.org/serve/WeihnachtenBeiHoppenstedts/Weihnachten%20bei%20Hoppenstedts.mp4 works, just open the debug menu - select network - media, copy the first URL

use the same thing with this link - https://archive.org/serve/WeihnachtenBeiHoppenstedts/Weihnachten%20bei%20Hoppenstedts.mp4

pukkandan commented 3 years ago

Related: https://github.com/ytdl-org/youtube-dl/pull/27156