petea / sage

A feed reader for Firefox.
http://sagerss.com
27 stars 8 forks source link

XML Parse Error on international (e.g., accented) characters #75

Closed petea closed 11 years ago

petea commented 11 years ago

Original author: stephenc...@gmail.com (August 24, 2011 01:26:23)

What steps will reproduce the problem?

  1. Bookmark this feed in Sage: http://dirk.eddelbuettel.com/cranberries/index.rss
  2. At 8/23/2011 9:25pm Eastern Daylight Time, click on the link in the Sage pane.

What is the expected output? What do you see instead? I expect to see the CRANberries RSS feed content. Instead, I get an XML Parse Error on column 50 of this line: <strong>Author</strong>: Maxime Hervé <br>

As you can see, this line contains an accented "e" character in a person's name, and as a result, Sage barfs on the entire feed.

What version of Sage are you using? 1.4.12

What version of Firefox are you using? On what operating system? 6.0 for Ubuntu

Please provide any additional information below.

The same thing happens with ampersands that don't follow entity syntax; for instance, line 664 of this feed: Error: not well-formed Source File: http://rss.acm.org/dl/RM740.xml.

Original issue: http://code.google.com/p/sage/issues/detail?id=76

petea commented 11 years ago

This issue is a duplicate of #19