ruizhecao96 / CMGAN

Conformer-based Metric GAN for speech enhancement
MIT License
309 stars 60 forks source link

模型训练的采样率以及显卡训练配置咨询 #47

Closed Johnsonabuse closed 2 months ago

Johnsonabuse commented 3 months ago

您好 很感谢您在语音增强领域做出的贡献 我想问一下您的模型训练的显卡配置是什么 我下载了VCTK-DEMAND数据集 但是训练数据是48khz的 请问您用作训练的数据的samplerate是经过重采样降到16khz吗?

BancoLin commented 3 months ago

You have to use high-end GPUs at least 24G RAM, otherwise you have to lower FFT resolution or segment duration. As to VCTK-DEMAND, yes, you need to downsample to 16kHz by yourself.