dimones / feedparser

Automatically exported from code.google.com/p/feedparser
Other
0 stars 0 forks source link

Content inside HTML-escaped CDATA Ignored #413

Open GoogleCodeExporter opened 9 years ago

GoogleCodeExporter commented 9 years ago
What steps will reproduce the problem?
1. Try and parse the attached RSS sample.

   >>> d = feedparser.parse('bushfireAlert.xml')

2. The item description only contains "MORE!", all the other content is missing.

    >>> d['entries'][0]['description']
    u'MORE!'

What is the expected output? What do you see instead?

Expect:

Alert Level: Advice.<br />Location: Herberton Petford Rd, 
Irvinebank.<br />Reported: 12/09/2013 1:17:49 PM.<br />Current 
Status: Going.<br />Details: A fire or other emergency has started in the 
area however there is no immediate threat. MORE!

Result:

MORE!

What version of the product are you using? On what operating system?

5.1.3, Mac OSX 10.8, Python 2.7.2.

Please provide any additional information below.

This sample it taken from the feed:

http://www.ruralfire.qld.gov.au/bushfirealert/bushfireAlert.xml

Original issue reported on code.google.com by everbl...@gmail.com on 18 Sep 2013 at 2:44

Attachments:

GoogleCodeExporter commented 9 years ago
Thanks for reporting this. I'll review this as I have time and see what can be 
done.

Original comment by kurtmckee on 10 Jul 2014 at 4:43