PyYoshi / cChardet

universal character encoding detector
Other
383 stars 51 forks source link

Wrong detect ISO-8859-2 instead Windows-1250 #101

Open LessNick opened 1 week ago

LessNick commented 1 week ago

Hello there o/

I have text in Windows-1250 encoding, but the library stubbornly believes that this is ISO-8859-2 encoding and at the same time formats the unreadable character “Ś” as “\x8”.

Is it possible to somehow correct false positives?

Thank you very much in advance.

SLIMOK.TXT