CharsetDetector / UTF-unknown

Character set detector build in C# - .NET 5+, .NET Core 2+, .NET standard 1+ & .NET 4+
307 stars 46 forks source link

Port multi-byte character ratio detection in UTF-8 prober confidence function from jschardet #117

Closed yinyue200 closed 3 years ago

yinyue200 commented 3 years ago

fix #108

304NotModified commented 3 years ago

@rstm-sf do we think we should merge this one?

304NotModified commented 3 years ago

@yinyue200 do you think you could check the review comments? Or should be close this PR for now?