anoopkunchukuttan / indic_nlp_library

Resources and tools for Indian language Natural Language Processing
http://anoopkunchukuttan.github.io/indic_nlp_library/
MIT License
546 stars 158 forks source link

Detect the language of transliterated text #33

Closed bnriiitb closed 3 years ago

bnriiitb commented 3 years ago

Is there any functionality to detect the language of a transliterated text?

anoopkunchukuttan commented 3 years ago

No, there is no functionality today. Do you have sufficient transliterated text in the language. It will be easy to train a language identificerif you have such data.

bnriiitb commented 3 years ago

Thank you @anoopkunchukuttan for the response. I have a lot of transliterated text but the concern is it's not tagged and also some times a single text has transliterated words of multiple languages