AI4Bharat / indicnlp_catalog

A collaborative catalog of NLP resources for Indic languages
https://ai4bharat.github.io/indicnlp_catalog
552 stars 79 forks source link

Bangla2B+ monolingual corpus and new Bengali benchmarks #164

Open GokulNC opened 2 years ago

GokulNC commented 2 years ago

Paper: https://arxiv.org/abs/2101.00204 Data: https://github.com/csebuetnlp/banglabert#datasets (Not yet released maybe)