issues
search
AI4Bharat
/
indicnlp_catalog
A collaborative catalog of NLP resources for Indic languages
https://ai4bharat.github.io/indicnlp_catalog
531
stars
77
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Parallel corpora for NE languges from NIT Silchar
#201
anoopkunchukuttan
opened
1 year ago
0
Code-Mixed medical conversations in the Telugu-English language.
#200
anoopkunchukuttan
closed
1 year ago
1
Issues: # 62,64,67,69,70 addressed
#199
sangeeta-anoop
closed
1 year ago
0
SkitAI Speech Intent Classification
#198
GokulNC
opened
1 year ago
0
Changed the link for DNLP-Tel corpora
#197
neshkatrapati
closed
1 year ago
1
Issue :#93,88,80,77 addressed.
#196
sangeeta-anoop
closed
1 year ago
0
Issues #103,104,92 addressed.
#195
sangeeta-anoop
closed
1 year ago
0
Hindi Hate Speech Evaluation dataset
#194
anoopkunchukuttan
closed
1 year ago
1
Issue: #116,130 addressed
#193
sangeeta-anoop
closed
1 year ago
0
Issues: #164,152,134,130 addressed.
#192
sangeeta-anoop
closed
1 year ago
2
Issues: # 160,167,168,169 addressed.
#191
sangeeta-anoop
closed
1 year ago
0
Issue #142, 143 addressed.
#190
sangeeta-anoop
closed
1 year ago
0
BigScience BLOOM model
#189
anoopkunchukuttan
closed
1 year ago
0
Meta NLLB resources
#188
anoopkunchukuttan
closed
1 year ago
1
# 184 addressed and updated links to point to the new AI4Bharat website.
#187
sangeeta-anoop
closed
1 year ago
0
CSTD-Telugu ASR Corpus
#186
GokulNC
opened
1 year ago
3
Issue# 181, 178, 165 , IndicXLIT, Aksharantar
#185
sangeeta-anoop
closed
1 year ago
0
IISc-MILE ASR Corpus
#184
GokulNC
closed
1 year ago
1
Issue #182
#183
sangeeta-anoop
closed
1 year ago
0
AI4Bharat IndicNER and Naamapadam: NER dataset & model for 11 Indic languages
#182
anoopkunchukuttan
closed
1 year ago
0
AsNER: Assamese NER dataset and model
#181
anoopkunchukuttan
closed
1 year ago
0
L3Cube-MahaL3CubeMahaSent
#180
anoopkunchukuttan
opened
1 year ago
0
L3Cube-MahaHate
#179
anoopkunchukuttan
opened
1 year ago
0
L3Cube-MahaNER: Marathi NER dataset
#178
anoopkunchukuttan
closed
1 year ago
0
CEnTam- Corpus
#177
sanjanasri
opened
1 year ago
1
Is there a plan to add 'Punjabi' language dataset to the corpus?
#176
PrabhjotKaurGosal
opened
1 year ago
2
Bengali Visual Genome
#175
anoopkunchukuttan
opened
1 year ago
0
Corpora for extremely low-resource languages
#174
anoopkunchukuttan
opened
1 year ago
0
Crabudan Project
#173
anoopkunchukuttan
opened
1 year ago
0
English-Bodo parallel corpus
#172
anoopkunchukuttan
opened
1 year ago
0
KMI Bodo monolingual corpus
#171
anoopkunchukuttan
closed
1 year ago
1
IWN Word List
#170
maharajbrahma
opened
2 years ago
0
XAlign: : Cross-lingual Fact-to-Text Alignment and Generation for Low-Resource Language
#169
anoopkunchukuttan
closed
1 year ago
0
XL-SUM
#168
anoopkunchukuttan
closed
1 year ago
1
MASSIVE SLU/NLU dataset from Amazon
#167
anoopkunchukuttan
closed
1 year ago
0
ATIS Spoken Language Understanding (SLU) dataset for Hindi
#166
anoopkunchukuttan
opened
2 years ago
0
HiNER: Hindi NER dataset
#165
anoopkunchukuttan
closed
1 year ago
1
Bangla2B+ monolingual corpus and new Bengali benchmarks
#164
GokulNC
opened
2 years ago
0
IIIT-H OCR benchmark for Gujarati & Tamil
#163
GokulNC
opened
2 years ago
0
Devanagari scene-text videos
#162
GokulNC
opened
2 years ago
0
Sinhala TTS - PathNirvana dataset
#161
GokulNC
opened
2 years ago
0
Tham Khasi corpus
#160
anoopkunchukuttan
closed
1 year ago
0
Aspect Based Sentiment Analysis in Hindi
#159
Pruthwik
opened
2 years ago
0
IndicLink dataset -- Multilingual Fact Linking on Knowledge Graphs
#158
GokulNC
opened
2 years ago
0
CVSS Speech Translation (synthetic) dataset
#157
GokulNC
opened
2 years ago
0
IndicSynthText
#156
GokulNC
opened
2 years ago
0
Tamil Treebank
#155
anoopkunchukuttan
opened
2 years ago
0
HLDC: Hindi Legal Corpus
#154
anoopkunchukuttan
opened
2 years ago
0
Dravidian language parallel corpora from DravidianLangTech workshop 2022@ACL
#153
anoopkunchukuttan
opened
2 years ago
0
Ema-lon Manipuri Corpus
#152
GokulNC
closed
1 year ago
0
Previous
Next