CharsetDetector / UTF-unknown

Character set detector build in C# - .NET 5+, .NET Core 2+, .NET standard 1+ & .NET 4+
303 stars 45 forks source link

UTF-8 file is deteceted as SBCSCodePageEncoding #168

Open Jujubeeeee opened 3 months ago

Jujubeeeee commented 3 months ago

Hi, there is a txt file with UTF-8, but is detected as SBCSCodePageEncoding. Thanks for your help! Brazil_Physical_Addresses_UTF8.txt