IndoNLP / nusa-crowd

A collaborative project to collect datasets in Indonesian languages.
Apache License 2.0
261 stars 61 forks source link

Closes #275 #16 #274 | Create dataset loader for INDspeech_TELDIALOG_LVCSR, Liputan6, & INDspeech_NEWS_EthnicSR #299

Closed ziweiji closed 1 year ago

ziweiji commented 1 year ago

Please name your PR after the issue it closes. You can use the following line: "Closes #ISSUE-NUMBER" where you replace the ISSUE-NUMBER with the one corresponding to your dataset.

Checkbox

ziweiji commented 1 year ago

python -m tests.test_nusantara nusacrowd/nusa_datasets/indsp_teldialog_lvcsr/indsp_teldialog_lvcsr.py --subset_id indsp_teldialog_lvcsr

SamuelCahyawijaya commented 1 year ago

/test dataset=liputan6

github-actions[bot] commented 1 year ago

Run result

Check test log here: https://github.com/IndoNLP/nusa-crowd/actions/runs/3160588702

SamuelCahyawijaya commented 1 year ago

/test dataset=liputan6 subset_id=liputan6_canonical

github-actions[bot] commented 1 year ago

Run result

Check test log here: https://github.com/IndoNLP/nusa-crowd/actions/runs/3160615466

SamuelCahyawijaya commented 1 year ago

/test dataset=indsp_teldialog_lvcsr

github-actions[bot] commented 1 year ago

Run result

Check test log here: https://github.com/IndoNLP/nusa-crowd/actions/runs/3160655781

SamuelCahyawijaya commented 1 year ago

/test dataset=indsp_teldialog_lvcsr

github-actions[bot] commented 1 year ago

Run result

Check test log here: https://github.com/IndoNLP/nusa-crowd/actions/runs/3160710538

SamuelCahyawijaya commented 1 year ago

The liputan6 & indsp_teldialog_lvcsr datasets looks good btw

holylovenia commented 1 year ago

/test dataset=indsp_news_ethnicsr subset_id=indsp_news_ethnicsr_Jawa_1

github-actions[bot] commented 1 year ago

Run result

Check test log here: https://github.com/IndoNLP/nusa-crowd/actions/runs/3167636054

gentaiscool commented 1 year ago

@ziweiji hi, I was wondering if you can make minor changes requested by @holylovenia It would be great if you can make it. Thanks!

SamuelCahyawijaya commented 1 year ago

/test dataset=indsp_news_ethnicsr subset_id=indsp_news_ethnicsr_jv_overlap

SamuelCahyawijaya commented 1 year ago

/test dataset=indsp_news_ethnicsr subset_id=indsp_news_ethnicsr_jv_nooverlap