SEACrowd / seacrowd-datahub

A collaborative project to collect datasets in SEA languages, SEA regions, or SEA cultures.
Apache License 2.0
55 stars 54 forks source link

Create dataset loader for Gowajee Corpus #586

Closed SamuelCahyawijaya closed 1 month ago

SamuelCahyawijaya commented 3 months ago

Dataloader name: gowajee/gowajee.py DataCatalogue: http://seacrowd.github.io/seacrowd-catalogue/card.html?gowajee

Dataset gowajee
Description The Gowajee corpus was collected in the Automatic Speech Recognition class offered at Chulalongkorn University as a homework assignment. Each group was asked to come up with an example smart home application.
Subsets -
Languages tha
Tasks Automatic Speech Recognition
License MIT (mit)
Homepage https://github.com/ekapolc/gowajee_corpus
HF URL -
Paper URL https://github.com/ekapolc/gowajee_corpus?tab=readme-ov-file
akhdanfadh commented 1 month ago

self-assign