HaveF / feedparser

Automatically exported from code.google.com/p/feedparser
Other
0 stars 0 forks source link

feedparser chokes on feeds with namespaced title tags #423

Closed GoogleCodeExporter closed 9 years ago

GoogleCodeExporter commented 9 years ago
Given a feed like:

...
    <item>
        <title>Ah!  Don&#8217;t shove blind people!</title>
        <link>http://saddlebaggins.wordpress.com/2014/03/03/ah-dont-shove-blind-people/</link>
        <pubDate>Mon, 03 Mar 2014 01:08:58 +0000</pubDate>
        <category><![CDATA[Uncategorized]]></category>
        <guid isPermaLink="false">http://saddlebaggins.wordpress.com/?p=639</guid>

.....

        <media:content url="http://www.visionaustralia.org/images/orientation-and-mobility/guiding_9_car.jpg?sfvrsn=0" medium="image">
            <media:title type="html">Hand on roof of car</media:title>
        </media:content>
    </item>

...

feedparser appears to ignore the media namespace and grabs the last title tag 
in *any* namespace. The parsed feed of this RSS Item is "Hand on roof of car" 
and not "Ah!  Don't shove blind people!"

Example data found @ http://saddlebaggins.wordpress.com/feed/

Thanks!

Original issue reported on code.google.com by Paul...@gmail.com on 11 Mar 2014 at 3:31

GoogleCodeExporter commented 9 years ago
Thanks for reporting this issue. This has been reported previously and 
unfortunately represents a larger issue with the media namespace support in 
feedparser due to its limited support of the namespace. I'm merging this with 
issue 257.

Original comment by kurtmckee on 10 Jul 2014 at 2:12