IndoNLP / nusa-crowd

A collaborative project to collect datasets in Indonesian languages.
Apache License 2.0
261 stars 61 forks source link

Create dataset loader for INDspeech_DIGIT_CDSR #278

Closed SamuelCahyawijaya closed 1 year ago

SamuelCahyawijaya commented 1 year ago

NusaCatalogue: https://indonlp.github.io/nusa-catalogue/card.html?indspeech_digit_cdsr

Dataset indspeech_digit_cdsr
Description INDspeech_DIGIT_CDSR is the first Indonesian speech dataset for connected digit speech recognition (CDSR). The data was developed by TELKOMRisTI (R&D Division, PT Telekomunikasi Indonesia) in collaboration with Advanced Telecommunication Research Institute International (ATR) Japan and Bandung Institute of Technology (ITB) under the Asia-Pacific Telecommunity (APT) project in 2004 [Sakti et al., 2004]. Although it was originally developed for a telecommunication system for hearing and speaking impaired people, it can be used for other applications, i.e., automatic call centers that recognize telephone numbers.
License CC-BY-NC-SA 4.0
IvanHalimP commented 1 year ago

self-assign

ziweiji commented 1 year ago

self-assign

holylovenia commented 1 year ago

Hi @ziweiji, apparently @IvanHalimP has self-assigned first and he already made a pull request, so there's no need for you to work on this dataloader. cc: @SamuelCahyawijaya