issues
search
AI4Bharat
/
indicnlp_catalog
A collaborative catalog of NLP resources for Indic languages
https://ai4bharat.github.io/indicnlp_catalog
552
stars
79
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Dravidian language parallel corpora from DravidianLangTech workshop 2022@ACL
#153
anoopkunchukuttan
opened
2 years ago
0
Ema-lon Manipuri Corpus
#152
GokulNC
closed
2 years ago
0
Parallel Corpora for 6 Indian Languages
#151
maharajbrahma
closed
2 years ago
2
English-Punjabi Code-Mixed Social Media Content
#150
maharajbrahma
opened
2 years ago
1
Prabhupadavani: A Code-mixed Speech Translation Data
#149
GokulNC
opened
2 years ago
0
AI4Bharat Listing of Indian language websites
#148
anoopkunchukuttan
opened
2 years ago
0
Kokborok raw data source: TRCI
#147
anoopkunchukuttan
opened
2 years ago
0
AI4Bharat Cross-lingual Semantic Textual Similarity
#146
anoopkunchukuttan
closed
2 years ago
4
Added Chaii QnA dataset in the QA subsection
#145
ritwikmishra
closed
2 years ago
1
Addressed issue #109
#144
sangeeta-anoop
closed
2 years ago
0
MultiCoNER: Multilingual Complex Named Entity Recognition
#143
anoopkunchukuttan
closed
2 years ago
0
AI4Bharat IndicBART
#142
anoopkunchukuttan
closed
2 years ago
0
Addressed issue #129
#141
sangeeta-anoop
closed
2 years ago
0
Changes for Issue #124
#139
sangeeta-anoop
closed
2 years ago
1
Updated coreference dataset URLs
#138
ritwikmishra
closed
2 years ago
1
BUILD Indian Legal Data Benchmark
#137
anoopkunchukuttan
opened
2 years ago
0
SentNoB: A Dataset for Analysing Sentiment on Noisy Bangla Texts
#136
anoopkunchukuttan
closed
2 years ago
0
IndoRE - Relation Extraction for three low resource Indian Languages
#135
GokulNC
opened
2 years ago
0
SinMin Corpus - Sinhala Monolingual data
#134
GokulNC
closed
2 years ago
1
WikiLingua+GlobalVoices: Abstractive Summarization Dataset
#133
GokulNC
opened
2 years ago
3
Pavlick Bilingual Dictionaries
#132
GokulNC
opened
2 years ago
0
Universal Romanizer
#131
GokulNC
opened
2 years ago
0
PHINC: A Parallel Hinglish Social Media Code-Mixed Corpus for Machine Translation
#130
GokulNC
closed
2 years ago
3
Request to add Hindi Reading Comprehension dataset
#129
erzaliator
closed
2 years ago
1
Hindi Xposition
#128
GokulNC
opened
3 years ago
0
Indian Sign Language Resources
#127
GokulNC
opened
3 years ago
0
IIIT-D Multilingual Abusive Comment Identiication
#126
GokulNC
opened
3 years ago
1
fixing link to IIIT-H treebank
#125
oligoglot
closed
2 years ago
4
Indian language Cognate Datasets
#124
anoopkunchukuttan
closed
2 years ago
0
Sindhi Resources
#123
GokulNC
opened
3 years ago
0
Punctuation Restoration Models
#122
GokulNC
opened
3 years ago
0
Indic Dictionaries in Babylon (StarDict) format
#121
GokulNC
opened
3 years ago
0
Ek-Step ULCA ASR Corpus
#120
GokulNC
opened
3 years ago
0
WikTra - Wiktionary Transliteration modules for 181 languages
#119
GokulNC
opened
3 years ago
0
Word Phonemizer
#118
GokulNC
opened
3 years ago
0
MuCS 2021: MUltilingual and Code-Switching ASR
#117
GokulNC
opened
3 years ago
0
KMI Linguistics
#116
GokulNC
opened
3 years ago
0
Dhivehi Resources
#115
GokulNC
opened
3 years ago
0
WIT : Wikipedia-based Image Text Dataset
#114
GokulNC
opened
3 years ago
0
Is there any way to improve misspelled words in a sentence for Hindi language
#113
surisettynagaraju
opened
3 years ago
1
CoRSAL archive - raw audio, text for low resource indian languages
#112
gowtham1997
opened
3 years ago
0
bengali huggingface wav2vec2 based ASR model
#111
arijitx
closed
3 years ago
1
QA dataset for Bengali
#110
anoopkunchukuttan
closed
2 years ago
2
Hindi Distractor Generator dataset
#109
anoopkunchukuttan
closed
2 years ago
0
Bengali Abstractive News Summarization
#108
anoopkunchukuttan
opened
3 years ago
0
Various Sanskrit text sources
#107
anoopkunchukuttan
opened
3 years ago
2
ML-based Schwa deletion tool for Hindi & Punjabi
#106
GokulNC
opened
3 years ago
0
Added Vāksañcayaḥ, a corpus for Sanskrit ASR
#105
krishnamrith12
closed
3 years ago
0
Sanskrit-English MT data
#104
GokulNC
opened
3 years ago
1
Hindi-Kangri dataset
#103
anoopkunchukuttan
closed
2 years ago
2
Previous
Next