libo26 / feedparser

Automatically exported from code.google.com/p/feedparser
Other
0 stars 0 forks source link

Images with title attributes are truncated at " #117

Closed GoogleCodeExporter closed 9 years ago

GoogleCodeExporter commented 9 years ago
What steps will reproduce the problem?
1. Check out the dinosaur comics feed: http://www.rsspect.com/rss/qwantz.xml
2. look for a comic with a long title attribute (aka funny mouse-over text)

What is the expected output? What do you see instead?
I should see all of the html (which is encoded as html of course),
including the title, like:
"<img src="http://www.qwantz.com/comics/comic2-1248.png" width="735"
height="500" title="the eponymous juliet is 13 in &quot;romeo and
juliet&quot;! did you know that? kinda weird, shakespeare" border="0">

instead it gets chopped off when it sees "&amp;quot;":
<img src="http://www.qwantz.com/comics/comic2-1248.png" width="735"
height="500" title="the eponymous juliet is 13 in " />

What version of the product are you using? On what operating system?
4.1, ubuntu hardy

Please provide any additional information below.
I would be happy to fix this bug myself, I just need some guidance about
where this type of processing is done in the code.

Original issue reported on code.google.com by yww...@gmail.com on 14 May 2008 at 3:21

GoogleCodeExporter commented 9 years ago
Please close this bug as fixed.

I tested using svn trunk and the URL provided, and found that a recent comic 
[1] containing quotes in the img title attribute was escaped as expected.

[1] http://www.qwantz.com/index.php?comic=1848

Original comment by kurtmckee on 3 Dec 2010 at 10:11

GoogleCodeExporter commented 9 years ago

Original comment by adewale on 4 Dec 2010 at 10:27