AI4Bharat / indicTrans

indicTranslate v1 - Machine Translation for 11 Indic languages. For latest v2, check: https://github.com/AI4Bharat/IndicTrans2
https://ai4bharat.iitm.ac.in/indic-trans
MIT License
116 stars 31 forks source link

Added a section in finetuning colab for indic2indic mining #12

Closed gowtham1997 closed 3 years ago

gowtham1997 commented 3 years ago

I also left a note on how to remove duplicates and overlaps for the generated indic2indic data. But will document this more in the coming days as part of the training documentation

anoopkunchukuttan commented 3 years ago

Ok, m2m extraction documentation is useful.