Open hpwamr opened 4 years ago
Like #16, the number of strings that need to be determined is too short.
Note that the Windows-1258 issue does not occur on libchardet. This is due to the difference in tables in Vietnamese language between libchardet and uchardet.
Hello, For the development of Notepad3, we use the UCHARDET Charset Detector.
In issue #1848 we are faced with a problem of a Single "UTF-8" character which is detected as: Windows-1258 with reliability level of 72% by UCHARDET. 😕
Here the French "é" character (Précis:) !
In the following sample, it's the character character "¶" this is badly detectected as: "ΒΆ"
In attachment the 2 samples: Error Detection Single UTF-8 (issue #1848).zip
Thanks in advance for your attention. Have a nice day. hpwamr
Feel free to test the BETA version "Notepad3Portable_5.20.116.2708_BETA.paf.exe.7z" or higher. See "Notepad3 BETA-channel access #1129" or here Notepad3Portable_5.20.116.2708_BETA.paf.exe.7z.
Note: "Notepad3Portable BETA" can be used in "2 flavors" (with or without the extension ".7z").
Your comments and suggestions are always welcome... 😃