dhh1128 / intent

the intent formal language
https://intentlang.org
2 stars 1 forks source link

improve unicode support in intent #114

Open dhh1128 opened 9 years ago

dhh1128 commented 9 years ago

Although current functions behave as intended and desired, add additional functions to handle cesu-8 and modified utf8, as well as guarantee correct disallowing of overlong encodings and invalid sequences (which may already be correct; we just need to prove it). See http://j.mp/1GRqXbX and http://j.mp/1HzJPBY.

Need to provide variant of get_codepoint_from_utf8() that allows customizations of the substitution strategy:

May also want to link in libICU.