Add GigaSpeech 2 recipe

lhotse-speech / lhotse

Tools for handling speech data in machine learning projects.

Apache License 2.0

935 stars 214 forks source link

This PR adds a recipe for GigaSpeech 2. GigaSpeech 2 raw comprises about 30,000 hours of automatically transcribed speech across Thai, Indonesian, and Vietnamese. GigaSpeech 2 refined consists of 10,000 hours of Thai, 6,000 hours each for Indonesian and Vietnamese. GigaSpeech 2 test sets more realistically reflect speech recognition scenarios and mirror the real performance of an ASR system for low-resource languages.

For more details, please visit: Dataset: https://huggingface.co/datasets/speechcolab/gigaspeech2 Preprint paper: https://arxiv.org/pdf/2406.11546

lhotse-speech / lhotse

Add GigaSpeech 2 recipe #1365