readium / readium-sdk

A C++ ePub renderer SDK
BSD 3-Clause "New" or "Revised" License
390 stars 163 forks source link

Android - UTF-8 vs modified UTF-8 vs UTF-16 #174

Open mark-buer opened 9 years ago

mark-buer commented 9 years ago

I understand that the Android JNI bindings are still WIP, but this seems an important issue to raise so it can be addressed before a stable Android release.

On the JNI API, the *UTFString* API functions speak a funny dialect of UTF-8 called "Modified UTF-8".

Do the Android Readium bindings wrongly assume that the *UTFString* functions speak proper UTF-8?

The following hints at a potential problem:

Most JNI projects I've seen use the UTF-16 variants of the JNI API. These are the *String* API functions (sans UTF).

rkwright commented 9 years ago

@mark-buer : Unfortunately, the developers who created the JNI bindings have moved on to other projects and are no longer around to answer these questions. I think you clearly have more knowledge of this area than I. I will put this on the agenda for our Wednesday meeting at 1600h UTC. You are more than welcome to attend. Feel free to contact me directly at rkwright@readium.org. Thanks