Closed andreabisello closed 1 year ago
Yes, this is certainly possible. The sum of the ngram probabilities for Italian will be larger than the sum of the ngram probabilities for English. This is not a bug in the library. The statistical approach is never 100% correct.
these strings are recognized in italian, even they are in english (and there is any italian word in this)