Create dataset loader for JV-ID ASR

NusaCatalogue: https://indonlp.github.io/nusa-catalogue/card.html?jv_id_asr

Dataset	jv_id_asr
Description	This data set contains transcribed audio data for Javanese. The data set consists of wave files, and a TSV file. The file utt_spk_text.tsv contains a FileID, UserID and the transcription of audio in the file. The data set has been manually quality checked, but there might still be errors. This dataset was collected by Google in collaboration with Reykjavik University and Universitas Gadjah Mada in Indonesia.
License	CC-BY-SA 4.0

Dataset

jv_id_asr

Description

This data set contains transcribed audio data for Javanese. The data set consists of wave files, and a TSV file. The file utt_spk_text.tsv contains a FileID, UserID and the transcription of audio in the file. The data set has been manually quality checked, but there might still be errors. This dataset was collected by Google in collaboration with Reykjavik University and Universitas Gadjah Mada in Indonesia.

License

CC-BY-SA 4.0

IndoNLP / nusa-crowd

Create dataset loader for JV-ID ASR #282

self-assign