ytdl-org / youtube-dl

Command-line program to download videos from YouTube.com and other video sites
http://ytdl-org.github.io/youtube-dl/
The Unlicense
130.29k stars 9.83k forks source link

[MLB] New MLB.com video urls not supported #21241

Open lawlmart opened 5 years ago

lawlmart commented 5 years ago

Checklist

Verbose log

$ youtube-dl https://www.mlb.com/video/must-c-belli-s-impressive-throw --verbose [debug] System config: [] [debug] User config: [] [debug] Custom config: [] [debug] Command-line args: [u'https://www.mlb.com/video/must-c-belli-s-impressive-throw', u'--verbose'] [debug] Encodings: locale UTF-8, fs utf-8, out UTF-8, pref UTF-8 [debug] youtube-dl version 2019.05.20 [debug] Python version 2.7.10 (CPython) - Darwin-18.6.0-x86_64-i386-64bit [debug] exe versions: ffmpeg 4.1.3, ffprobe 4.1.3, rtmpdump 2.4 [debug] Proxy map: {} [generic] must-c-belli-s-impressive-throw: Requesting header WARNING: Falling back on generic information extractor. [generic] must-c-belli-s-impressive-throw: Downloading webpage [generic] must-c-belli-s-impressive-throw: Extracting information ERROR: Unsupported URL: https://www.mlb.com/video/must-c-belli-s-impressive-throw Traceback (most recent call last): File "/usr/local/bin/youtube-dl/youtube_dl/extractor/generic.py", line 2340, in _real_extract doc = compat_etree_fromstring(webpage.encode('utf-8')) File "/usr/local/bin/youtube-dl/youtube_dl/compat.py", line 2551, in compat_etree_fromstring doc = _XML(text, parser=etree.XMLParser(target=_TreeBuilder(element_factory=_element_factory))) File "/usr/local/bin/youtube-dl/youtube_dl/compat.py", line 2540, in _XML parser.feed(text) File "/System/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/xml/etree/ElementTree.py", line 1642, in feed self._raiseerror(v) File "/System/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/xml/etree/ElementTree.py", line 1506, in _raiseerror raise err ParseError: not well-formed (invalid token): line 110, column 83 Traceback (most recent call last): File "/usr/local/bin/youtube-dl/youtube_dl/YoutubeDL.py", line 796, in extract_info ie_result = ie.extract(url) File "/usr/local/bin/youtube-dl/youtube_dl/extractor/common.py", line 529, in extract ie_result = self._real_extract(url) File "/usr/local/bin/youtube-dl/youtube_dl/extractor/generic.py", line 3329, in _real_extract raise UnsupportedError(url) UnsupportedError: Unsupported URL: https://www.mlb.com/video/must-c-belli-s-impressive-throw

Description

MLB has a new video setup and it doesn't appear to be supported for newer URLs. Here's another one: https://www.mlb.com/video/yonny-chirinos-in-play-out-s-to-rowdy-tellez

RingoTheDog commented 4 years ago

Bump.... and new example:
https://www.mlb.com/video/boone-on-implementing-mercy-rule?t=most-popular

youtube-dl.py -v "https://www.mlb.com/video/boone-on-implementing-mercy-rule?t=mos t-popular" [debug] System config: [] [debug] User config: [] [debug] Custom config: [] [debug] Command-line args: ['-v', 'https://www.mlb.com/video/boone-on-implementing-mercy-rule?t=most -popular'] [debug] Encodings: locale cp1252, fs mbcs, out cp850, pref cp1252 [debug] youtube-dl version 2019.08.13 [debug] Python version 3.5.1 (CPython) - Windows-7-6.1.7601-SP1 [debug] exe versions: ffmpeg N-71727-g46778ab, ffprobe 3.2, rtmpdump 2.4 [debug] Proxy map: {} [generic] boone-on-implementing-mercy-rule?t=most-popular: Requesting header WARNING: Falling back on generic information extractor. [generic] boone-on-implementing-mercy-rule?t=most-popular: Downloading webpage [generic] boone-on-implementing-mercy-rule?t=most-popular: Extracting information ERROR: Unsupported URL: https://www.mlb.com/video/boone-on-implementing-mercy-rule?t=most-popular Traceback (most recent call last): File "C:\Transmogrifier\youtube-dl.py\youtube_dl\YoutubeDL.py", line 796, in extract_info ie_result = ie.extract(url) File "C:\Transmogrifier\youtube-dl.py\youtube_dl\extractor\common.py", line 530, in extract ie_result = self._real_extract(url) File "C:\Transmogrifier\youtube-dl.py\youtube_dl\extractor\generic.py", line 3333, in _real_extrac t raise UnsupportedError(url) youtube_dl.utils.UnsupportedError: Unsupported URL: https://www.mlb.com/video/boone-on-implementing- mercy-rule?t=most-popular

Thanks Ringo

RingoTheDog commented 4 years ago

Another example... MLB Falls back to generic and can not find video...but direct link to video does download correctly:

Embedded URL does NOT work: https://www.mlb.com/video/player-spotlight-james-paxton

Direct link to video DOES work: https://cuts.diamond.mlb.com/FORGE/2019/2019-07/26/af1803aa-3c589170-5380437d-csvm-diamondx64-asset_1280x720_59_4000K.mp4

youtube-dl.py -v https://www.mlb.com/video/player-spotlight-ja mes-paxton [debug] System config: [] [debug] User config: [] [debug] Custom config: [] [debug] Command-line args: ['-v', 'https://www.mlb.com/video/player-spotlight-ja mes-paxton'] [debug] Encodings: locale cp1252, fs utf-8, out utf-8, pref cp1252 [debug] youtube-dl version 2019.08.13 [debug] Python version 3.6.3 (CPython) - Windows-7-6.1.7601-SP1 [debug] exe versions: ffmpeg N-71727-g46778ab, rtmpdump 2.4 [debug] Proxy map: {} [generic] player-spotlight-james-paxton: Requesting header WARNING: Falling back on generic information extractor. [generic] player-spotlight-james-paxton: Downloading webpage [generic] player-spotlight-james-paxton: Extracting information ERROR: Unsupported URL: https://www.mlb.com/video/player-spotlight-james-paxton Traceback (most recent call last): File "C:\Transmogrifier\youtube-dl.py\youtube_dl\YoutubeDL.py", line 796, in e xtract_info ie_result = ie.extract(url) File "C:\Transmogrifier\youtube-dl.py\youtube_dl\extractor\common.py", line 53 0, in extract ie_result = self._real_extract(url) File "C:\Transmogrifier\youtube-dl.py\youtube_dl\extractor\generic.py", line 3 333, in _real_extract raise UnsupportedError(url) youtube_dl.utils.UnsupportedError: Unsupported URL: https://www.mlb.com/video/pl ayer-spotlight-james-paxton

youtube-dl.py -v "https://cuts.diamond.mlb.com/FORGE/2019/2019 -07/26/af1803aa-3c589170-5380437d-csvm-diamondx64-asset_1280x720_59_4000K.mp4" [debug] System config: [] [debug] User config: [] [debug] Custom config: [] [debug] Command-line args: ['-v', 'https://cuts.diamond.mlb.com/FORGE/2019/2019- 07/26/af1803aa-3c589170-5380437d-csvm-diamondx64-asset_1280x720_59_4000K.mp4'] [debug] Encodings: locale cp1252, fs utf-8, out utf-8, pref cp1252 [debug] youtube-dl version 2019.08.13 [debug] Python version 3.6.3 (CPython) - Windows-7-6.1.7601-SP1 [debug] exe versions: ffmpeg N-71727-g46778ab, rtmpdump 2.4 [debug] Proxy map: {} [generic] af1803aa-3c589170-5380437d-csvm-diamondx64-asset_1280x720_59_4000K: Re questing header [debug] Default format spec: bestvideo+bestaudio/best [debug] Invoking downloader on 'https://cuts.diamond.mlb.com/FORGE/2019/2019-07/ 26/af1803aa-3c589170-5380437d-csvm-diamondx64-asset_1280x720_59_4000K.mp4' [download] Destination: af1803aa-3c589170-5380437d-csvm-diamondx64-asset_1280x72 0_59_4000K-af1803aa-3c589170-5380437d-csvm-diamondx64-asset_1280x720_59_4000K.mp 4 [download] 100% of 106.16MiB in 01:12

RingoTheDog commented 3 years ago

A post pandemic example ;-)

URL: https://www.mlb.com/news/javier-baez-makes-slick-defensive-play-at-shortstop

falls back to generic and fails:

C:>youtube-dl.py -v "https://www.mlb.com/news/javier-baez-makes-slick-defensive-play-at-shortstop" [debug] System config: [] [debug] User config: [] [debug] Custom config: [] [debug] Command-line args: ['-v', 'https://www.mlb.com/news/javier-baez-makes-slick-defensive-play-at-shortstop'] [debug] Encodings: locale cp1252, fs utf-8, out utf-8, pref cp1252 [debug] youtube-dl version 2021.02.22 [debug] Python version 3.6.6 (CPython) - Windows-10-10.0.14393-SP0 [debug] exe versions: ffmpeg 4.3.2-2021-02-02-full_build-www.gyan.dev, ffprobe 4.3.2-2021-02-02-full_build-www.gyan.dev, rtmpdump 2.4 [debug] Proxy map: {} [generic] javier-baez-makes-slick-defensive-play-at-shortstop: Requesting header WARNING: Falling back on generic information extractor. [generic] javier-baez-makes-slick-defensive-play-at-shortstop: Downloading webpage [generic] javier-baez-makes-slick-defensive-play-at-shortstop: Extracting information ERROR: Unsupported URL: https://www.mlb.com/news/javier-baez-makes-slick-defensive-play-at-shortstop Traceback (most recent call last): File "C:\TRANSMOGRIFIER\youtube-dl.py\youtube_dl\YoutubeDL.py", line 806, in wrapper return func(self, *args, **kwargs) File "C:\TRANSMOGRIFIER\youtube-dl.py\youtube_dl\YoutubeDL.py", line 827, in __extract_info ie_result = ie.extract(url) File "C:\TRANSMOGRIFIER\youtube-dl.py\youtube_dl\extractor\common.py", line 532, in extract ie_result = self._real_extract(url) File "C:\TRANSMOGRIFIER\youtube-dl.py\youtube_dl\extractor\generic.py", line 3506, in _real_extract raise UnsupportedError(url) youtube_dl.utils.UnsupportedError: Unsupported URL: https://www.mlb.com/news/javier-baez-makes-slick-defensive-play-at-shortstop