alexandrainst / coral

Danish ASR and TTS models associated with the CoRal project.
MIT License
9 stars 1 forks source link

Add NST dataset #27

Closed saattrupdan closed 1 year ago

saattrupdan commented 1 year ago

This PR adds the Danish NST dataset, by adding a build_nst_da script that downloads it and uploads it to Hugging Face Hub (this has already been done), as well as including the dataset as a dataset config.

This closes #11

AJDERS commented 1 year ago

Not that it fits inside this PR (or maybe it does judging by the name of the PR), but i think it could be useful in the future to have the other parts of the NST dataset, no?

Maybe rename the script to build_nst.py, change the URL-dict to a dict of dicts, and add a language-flag?