microsoft / MS-SNSD

The Microsoft Scalable Noisy Speech Dataset (MS-SNSD) is a noisy speech dataset that can scale to arbitrary sizes depending on the number of speakers, noise types, and Speech to Noise Ratio (SNR) levels desired.
MIT License
468 stars 142 forks source link

How do I generate audio data with fixed audio length? #21

Open Superklez opened 3 years ago

Superklez commented 3 years ago

Is there a way to generate audio data with fixed audio length? I know there's audio_length that specifies the minimum length of each audio clip, but is there a way to specify the maximum audio length?