CharsetDetector / UTF-unknown

Character set detector build in C# - .NET 5+, .NET Core 2+, .NET standard 1+ & .NET 4+
303 stars 45 forks source link

Fix false negative with HZ detection #129

Closed jairbubbles closed 2 years ago

jairbubbles commented 2 years ago

I'm not too familiar with HZ sequence but it seems like a very corner case and it prevents some files to be decteded as ASCII. I'm proposing a very WIP fix, let me know your thoughts!

304NotModified commented 2 years ago

looks good to me. What do you think @rstm-sf?

jairbubbles commented 2 years ago

Let me know if I need to make other changes.