Open Rikorose opened 2 years ago
Would you please fix this issue at your early convenience?? many thanks.
zenodo VocalSet would be helpful.
@nicriverhoo you could fix this issue with SoX sampling rate conversion. f.e you could run such bash in vocalset directory:
find ./ -name "*.wav" -exec sox -r 16k {} ../singing_voice_16k/{} \;
The samples provided in
datasets_fullband/clean_fullband/VocalSet_48kHz_mono
have a reported sampling rate of 48kHz.The real sampling rate, however, is 16kHz, which results in a mickey mouse type voice (i.e. a pitch 3 times higher than originally).
To reproduce with sox: