ytdl-org / youtube-dl

Command-line program to download videos from YouTube.com and other video sites
http://ytdl-org.github.io/youtube-dl/
The Unlicense
132.29k stars 10.03k forks source link

aljazeera not supported? #4487

Closed Fil closed 9 years ago

Fil commented 9 years ago

At https://github.com/rg3/youtube-dl/issues/3571#issuecomment-53172431 @phihag seems to imply that aljazeera links are supported, but I had no such luck on: http://www.aljazeera.com/programmes/witness/2014/08/miners-shot-down-2014812850358636.html nor on http://www.aljazeera.com/programmes/the-slum/2014/08/deliverance-201482883754237240.html

log:

$ youtube-dl http://www.aljazeera.com/programmes/the-slum/2014/08/deliverance-201482883754237240.html --verbose

[debug] System config: []
[debug] User config: []
[debug] Command-line args: ['http://www.aljazeera.com/programmes/the-slum/2014/08/deliverance-201482883754237240.html', '--verbose']
[debug] Encodings: locale UTF-8, fs utf-8, out UTF-8, pref UTF-8
[debug] youtube-dl version 2014.12.14
[debug] Python version 2.7.8 - Darwin-14.0.0-x86_64-i386-64bit
[debug] exe versions: ffmpeg 2.4.3, ffprobe 2.4.3, rtmpdump 2.4
[debug] Proxy map: {}
[generic] deliverance-201482883754237240: Requesting header
WARNING: Falling back on generic information extractor.
[generic] deliverance-201482883754237240: Downloading webpage
[generic] deliverance-201482883754237240: Extracting information
ERROR: Unsupported URL: http://www.aljazeera.com/programmes/the-slum/2014/08/deliverance-201482883754237240.html; please report this issue on https://yt-dl.org/bug . Make sure you are using the latest version; type  youtube-dl -U  to update. Be sure to call youtube-dl with the --verbose flag and include its complete output.
Traceback (most recent call last):
  File "/usr/local/bin/youtube-dl/youtube_dl/extractor/generic.py", line 660, in _real_extract
    doc = parse_xml(webpage)
  File "/usr/local/bin/youtube-dl/youtube_dl/utils.py", line 1437, in parse_xml
    tree = xml.etree.ElementTree.XML(s.encode('utf-8'), **kwargs)
  File "/usr/local/Cellar/python/2.7.8_2/Frameworks/Python.framework/Versions/2.7/lib/python2.7/xml/etree/ElementTree.py", line 1300, in XML
    parser.feed(text)
  File "/usr/local/Cellar/python/2.7.8_2/Frameworks/Python.framework/Versions/2.7/lib/python2.7/xml/etree/ElementTree.py", line 1642, in feed
    self._raiseerror(v)
  File "/usr/local/Cellar/python/2.7.8_2/Frameworks/Python.framework/Versions/2.7/lib/python2.7/xml/etree/ElementTree.py", line 1506, in _raiseerror
    raise err
ParseError: mismatched tag: line 67, column 55
Traceback (most recent call last):
  File "/usr/local/bin/youtube-dl/youtube_dl/YoutubeDL.py", line 555, in extract_info
    ie_result = ie.extract(url)
  File "/usr/local/bin/youtube-dl/youtube_dl/extractor/common.py", line 243, in extract
    return self._real_extract(url)
  File "/usr/local/bin/youtube-dl/youtube_dl/extractor/generic.py", line 1060, in _real_extract
    raise ExtractorError('Unsupported URL: %s' % url)
ExtractorError: Unsupported URL: http://www.aljazeera.com/programmes/the-slum/2014/08/deliverance-201482883754237240.html; please report this issue on https://yt-dl.org/bug . Make sure you are using the latest version; type  youtube-dl -U  to update. Be sure to call youtube-dl with the --verbose flag and include its complete output.

My IP does not appear to be blocked as I can watch the program directly from the website.

Fil commented 9 years ago

Let me add that if I take the "brightcove" URL from the Network panel of Firebug, youtube-dl properly fetches the video.

e.g.:

youtube-dl  "http://admin.brightcove.com/viewer/us20141117.1347/BrightcoveBootloader.swf?playerID=1422553060001&playerKey=AQ%7E%7E%2CAAAAmtVJIFk%7E%2CTVGOQ5ZTwJbeMWnq5d_H4MOM57xfzApc&purl=http%3A%2F%2Fwww.aljazeera.com%2Fprogrammes%2Fthe-slum%2F2014%2F08%2Fbreaking-out-201482891045531924.html&%40videoPlayer=3852255253001&ConvivaConfig.events=+%3CConvivaEventsMap%3E+%3Cevent+name%3D%22mediaPlay%22+module%3D%22videoPlayer%22+eventType%3D%22mediaPlay%22%3E+%3C%2Fevent%3E+%3Cevent+name%3D%22mediaError%22+module%3D%22videoPlayer%22+eventType%3D%22mediaError%22%3E+%3Cattr+key%3D%22errorCode%22+type%3D%22eventField%22+value%3D%22code%22%2F%3E+%3C%2Fevent%3E+%3C%2FConvivaEventsMap%3E+&adServerURL=http%3A%2F%2Fpubads.g.doubleclick.net%2Fgampad%2Fads%3Fsz%3D680x413%26iu%3D%2F5287%2Faljazeera_EN%2Fnews%26impl%3Ds%26gdfp_req%3D1%26env%3Dvp%26output%3Dxml_vast2%26unviewed_position_start%3D1%26url%3D%5Breferrer_url%5D%26correlator%3D%5Btimestamp%5D&autoStart=&bgcolor=%23FFFFFF&debuggerID=&dynamicStreaming=false&flashID=myExperience3852255253001&height=413&isUI=true&isVid=true&linkBaseURL=http%3A%2F%2Faje.me%2F1yCU3Mf&localizedErrorXML=http%3A%2F%2Fwebapps.aljazeera.net%2Fbrightcove%2Fplayer%2Fbc-georestrict-msgs.xml&startTime=1418715902586&videoSmoothing=false&width=680&wmode=transparent"
jaimeMF commented 9 years ago

Thanks for the report, it will work in the next version.

They probably embedded the brightcove player directly in the webpage, that's why youtube-dl used to recognized it (with the generic extractor). But they insert the player with javascript now.

phihag commented 9 years ago

Fixed in youtube-dl 20114.12.16. See our FAQ if you need help updating.

@Fil As you have seen, it is quite easy for us to overlook comments to issues, especially when they are closed. Please do open a new issue for a new issue. #3571 was about better brightcove detection, whereas this one was about detecting aljazeera specifically.

Fil commented 9 years ago

I took the chance of a simple comment, then came back a while later with an issue. Thank you for youtube-dl.