ravin353 / android-daisy-epub-reader

Automatically exported from code.google.com/p/android-daisy-epub-reader
0 stars 0 forks source link

Problems parsing files in ISO-8859-1 format with accented characters. #23

Closed GoogleCodeExporter closed 8 years ago

GoogleCodeExporter commented 8 years ago
What steps will reproduce the problem?
1. Try to load a DAISY 2.02 book containing accented characters where the XML 
encoding is set to iso-8859-1 e.g. a Spanish book with a title of {{{<meta 
name=\"dc:title\" content=\"El último judío\" />}}}

What is the expected output? 
- The file should be correctly processed.

What do you see instead?
The sax parser throws an exception of the form: 
org.apache.harmony.xml.ExpatParser$ParseException: At line 4, column 34: not 
well-formed (invalid token)

I've written some test code to try and isolate the issue, and to see if I can 
add a workaround so users can listen to Spanish books :)

Links for more information:
http://en.wikipedia.org/wiki/8859-1 which seems to confirm the file is 
correctly formatted (as far as I can tell).

Original issue reported on code.google.com by julianharty on 17 Oct 2010 at 8:03

GoogleCodeExporter commented 8 years ago
Support added in r170 of the software.

Original comment by julianharty on 19 Oct 2010 at 9:08