VocelSet_48kHz_mono dataset files are encoded iwth 48kHz but when played back, the voices are even higher pitched than my 3 year old son.
Playback at 16kHz sounds about right.
I suggest to re-encode the files with their correct samplerate at 16kHz, or to upsample them to 48kHz if consistency in samplerate is required for the dataset.
VocelSet_48kHz_mono
dataset files are encoded iwth 48kHz but when played back, the voices are even higher pitched than my 3 year old son.Playback at 16kHz sounds about right.
I suggest to re-encode the files with their correct samplerate at 16kHz, or to upsample them to 48kHz if consistency in samplerate is required for the dataset.