k2-fsa / icefall

https://k2-fsa.github.io/icefall/
Apache License 2.0
792 stars 267 forks source link

Zipformer recipe for ReazonSpeech #1611

Open Triplecq opened 2 weeks ago

Triplecq commented 2 weeks ago

ReazonSpeech is an open-source dataset that contains a diverse set of natural Japanese speech, collected from terrestrial television streams. It contains more than 35,000 hours of audio.

The dataset is available on Hugging Face. For more details, please visit:

danpovey commented 2 weeks ago

There are quite a few changes not in the directory you are adding. You might want to remove those as they are potential barriers to merging it. If there's anything outside that directory you believe we should change , it can be a separate PR.

Triplecq commented 2 weeks ago

There are quite a few changes not in the directory you are adding. You might want to remove those as they are potential barriers to merging it. If there's anything outside that directory you believe we should change , it can be a separate PR.

Thanks for your quick feedback during the holiday! I will remove unrelated changes and get back to you soon.

Triplecq commented 2 weeks ago

I've already removed those unrelated changes. It's ready for review now. Please let me know if you have any questions or comments. Thank you!

pzelasko commented 2 weeks ago

I noticed that you have „lhotse prepare reazonspeech” command in data prep, do you intend to submit a PR to Lhotse as well?

Triplecq commented 2 weeks ago

I noticed that you have „lhotse prepare reazonspeech” command in data prep, do you intend to submit a PR to Lhotse as well?

Thanks for the note. Sure, we're cleaning up the scripts and will submit a PR to Lhotse soon. :)

Triplecq commented 2 weeks ago

I just submitted a PR to Lhotse as well: https://github.com/lhotse-speech/lhotse/pull/1330 Both PR are ready for review. Thank you!