Open GoogleCodeExporter opened 9 years ago
langdetect is a language detection library for enough long text and is poor at
short text detection. In particular, one word detection is almost incorrect in
our way.
I don't know Polyglot3000 how to recognize, but I guess they have very huge
dictionaries which can't store on memory. That is not in my approch.
I'm researching a short text detection in parallel, but that can't also detect
one word's language probably...
The distribution page of Wikipedia abstract is noted here.
http://code.google.com/p/language-detection/wiki/Tools
Original comment by nakatani.shuyo
on 25 Jan 2012 at 2:57
Original issue reported on code.google.com by
Preload...@gmail.com
on 21 Jan 2012 at 2:59