FGRibreau / node-language-detect

🇫🇷 NodeJS language detection library using n-gram
http://blog.fgribreau.com/2011/07/week-end-project-nodejs-language.html
MIT License
397 stars 45 forks source link

Detect Cyrillic / Japanese / Thai / Gurmukhi #2

Closed FGRibreau closed 12 years ago

FGRibreau commented 13 years ago

Cyrilic Hiragana Thai Gurmukhi:

var text = tweet.getStatus().text;

// 0400 - 04FF: Cyrillic // 3040 - 309F: Hiragana (Japanese) // 0E00 - 0E7F: Thai // 0A00 - 0A7F: Gurmukhi (Indian) return !(/[\u0400-\u04FF\u3040-\u309F\u0E00-\u0E7F\u0A00-\u0A7F]+/.test(text));