pombreda / feedparser

Automatically exported from code.google.com/p/feedparser
Other
0 stars 0 forks source link

utf-16 detected as windows-1252 or None #180

Closed GoogleCodeExporter closed 9 years ago

GoogleCodeExporter commented 9 years ago
What steps will reproduce the problem?
% python
>>> import chardet
>>> chardet.detect(unicode('Hello World').encode('utf-16'))
{'confidence': 0.0, 'encoding': None}
>>> chardet.detect(unicode('hello world').encode('utf-16'))
{'confidence': 0.5, 'encoding': 'windows-1252'}

What is the expected output? What do you see instead?

I expect to see {'confidence': something_near_one, 'encoding': 'utf-16'}

What version of the product are you using? On what operating system?
1.0
os x 10.5.8
installed with macports port install py25-chardet

Please provide any additional information below.

Original issue reported on code.google.com by jesse.ro...@gmail.com on 17 Aug 2009 at 3:30

GoogleCodeExporter commented 9 years ago
Please close this bug as invalid.

This is a chardet issue, although reporter's document is so small I'm not sure 
it's statistically significant.

Original comment by kurtmckee on 4 Dec 2010 at 3:27

GoogleCodeExporter commented 9 years ago
Please file this here: http://code.google.com/p/chardet/issues/list

Original comment by adewale on 4 Dec 2010 at 10:45