rhdunn / cainteoir-engine

The Cainteoir Text-to-Speech core engine
http://reecedunn.co.uk/cainteoir/
GNU General Public License v3.0
43 stars 8 forks source link

make the XML reader use the character encoding API #12

Closed rhdunn closed 12 years ago

rhdunn commented 12 years ago

The current XML reader does not support character encodings. It should use the character encoding API and switch encoding on:

  1. Byte-Order Mark (BOM) for utf-8, utf-16 le/be and utf-32 le/be encodings;
  2. The encoding property for the <?xml?> processing instruction.

This should also support a mechanism to switch encodings to allow the html processing to handle content-type meta tags.

It should also be able to take a default encoding parameter to handle content-type encodings in mime headers,

rhdunn commented 12 years ago

This is now done (except for the BOM support)