Open goru001 opened 4 years ago
@Shubhamjain27 Will you be able to take this up?
Please help me i will train Telugu model .. I can see Language model file in NLP for Telugu ...where is seperate model located I am Telugu Speaking Guy..
@goru001 If someone isn't working on this, I can take this up. We can use pycld2, pycld3 , it identifies all the supported language except: oriya, bengali and sanskrit.
I have used the same in my own projects and it's also used by polyglot's language detection. https://github.com/aboSamoor/polyglot/blob/d0d2aa8/polyglot/detect/base.py#L72
What do you think ?
@lordzuko That'll be great! Feel free to raise a PR for this.
@goru001 can I take this issue up if it is still unresolved?
@nitkannen Yes sure, this is still unresolved and it'll be great if you can contribute!
Sure @goru001
@goru001 can you give me some guidance as to from where I can start to retrain the Telugu model. Any notebooks or scripts used for other languages and data can be really helpful
identify languages function which uses separate model for identifying the languages hasn't been retrained on Telugu in v0.9. Need to retrain it to support Telugu.