Mimino666 / langdetect

Port of Google's language-detection library to Python.
Other
1.71k stars 196 forks source link

Idea: Unicode language space detection #66

Open DonaldTsang opened 4 years ago

DonaldTsang commented 4 years ago

This might be useful for detection what language a string ISN'T, since for my use case I would like to block out Unicode codepoints that does not belong within a specific language. Note: The strings are singular or few word "image metadata tags".