Alorese Collection or Alorese Corpus is a collection of language data in a couple of Alorese variation (Alor and Pantar Alorese). The collection is available in video, audio, and text formats with genres ranging from Experiment or task, Stimuli, Discourse, and Written materials.
Subsets
-
Languages
aol, ind
Tasks
Language Modeling, Automatic Speech Recognition, Machine Translation
Dataloader name:
alorese/alorese.py
DataCatalogue: http://seacrowd.github.io/seacrowd-catalogue/card.html?alorese