lhotse-speech / lhotse

Tools for handling speech data in machine learning projects.
https://lhotse.readthedocs.io/en/latest/
Apache License 2.0
935 stars 214 forks source link

Add the ReazonSpeech recipe #1330

Closed Triplecq closed 4 months ago

Triplecq commented 5 months ago

ReazonSpeech is an open-source dataset that contains a diverse set of natural Japanese speech, collected from terrestrial television streams. It contains more than 35,000 hours of audio.

The dataset is available on Hugging Face. For more details, please visit:

pzelasko commented 5 months ago

Also, could you add an entry in the dataset table in docs/corpus.rst?

Triplecq commented 5 months ago

Thanks for your quick feedback and suggestions! I will get back to your comments soon. :)

Triplecq commented 4 months ago

Thanks for your quick feedback! I'll fix those checks very soon. :)

pzelasko commented 4 months ago

It seems the tests are failing on importing num2words, can you make it into a local import guarded by is_module_available (pls search lhotse sources for is_module_available to see an example of import guard for optional dependencies).

Triplecq commented 4 months ago

@pzelasko Thanks for the note! I've already changed it to local import. Please feel free to let me know if there's anything else I need to improve. :)