goru001 / inltk

Natural Language Toolkit for Indic Languages aims to provide out of the box support for various NLP tasks that an application developer might need
https://inltk.readthedocs.io
MIT License
824 stars 163 forks source link

How to identify whether the text is Hinglish? #76

Open codemaster-22 opened 3 years ago

codemaster-22 commented 3 years ago

Can you please tell us is there any way to identify if the text is Hinglish ? I went through the code of identify_language() and u are outputting en if we type something in English? Can you please reply asap I am in need of a model which identifies whether it's Hinglish or not.

goru001 commented 3 years ago

Hi @codemaster-22 , thanks for reaching out. Currently for all code-mixed languages in latin script, "en" is returned. There's this open issue as well for it.