shuyo / language-detection

This is a language detection library implemented in plain Java. (aliases: language identification, language guessing)
https://github.com/shuyo/language-detection/blob/wiki/ProjectHome.md
732 stars 184 forks source link

language is not detecting correctly #80

Open aslamhirani opened 8 years ago

aslamhirani commented 8 years ago

For below text, the language is show as english but it has two languages? "SuccessFactors——SAP人力资源管理解决方案介绍"

arnt commented 8 years ago

This language detection library answers the question "which language is this text?", which implicitly limits its input to texts for which the question makes sense. This excludes many strings, for example "Benito Mussolini", "1. e4e5 2. Nf3d6", "Willkommen, bienvenue, welcome", "https://www.youtube.com/watch?v=yQn8vPOCnp4" and "". The question simply doesn't apply to all these strings.