Closed RamonUnch closed 1 year ago
*SCSU still fails the test
your sample reveals a short point that SCSU that not handling code point that is smaller than U+0020. should be fixed by this commit: https://github.com/roytam1/rtoss/commit/ea2c8ad90aa5e50606b5b15d5e06e13c3527c457
and there is a glitch in rSCSU table that is fixed in upstream: https://github.com/roytam1/rtoss/commit/6f2475291d5e6cbeee93bd4a51854b8d932600b6
Nice findig indeed!
The character U+10000 must also be encoded as a a surrogate pair (D800 DC00) I used the attached file for testing (I just save in a given encoding, then re-open, then save back to utf8 and compare the files).
rUtf8/16/32(le/be) already OK
rBOCU already OK
rGB18030 already OK
rUtf1 FIXED!
rUtf9 FIXED:
rUtfOFSS Fixed!
*SCSU still fails the test
0-0x10ffff_UTF8.zip
@roytam1 you will be interested by this patch, maybe you will have an idea on how to fix SCSU read or write problem... I will have a look on my side...