Open avitalp opened 9 years ago
A language ID module using TextCat algorithm using language n-grams from "An Crubadan" project. In response to https://github.com/nltk/nltk/issues/107 and using https://github.com/nltk/nltk/pull/845
The method "demo" refers to several sample files which I didn't include, as I was not sure where they should be placed.
@alexrudnick: would you be able to provide sample texts for some of the less well-represented languages?
Thanks @avitalp. I am considering putting this in nltk/classify.
Thanks @stevenbird, that'd be great. Is there anything you'd like me to modify or add for that?
A language ID module using TextCat algorithm using language n-grams from "An Crubadan" project. In response to https://github.com/nltk/nltk/issues/107 and using https://github.com/nltk/nltk/pull/845
The method "demo" refers to several sample files which I didn't include, as I was not sure where they should be placed.
@alexrudnick: would you be able to provide sample texts for some of the less well-represented languages?