sp-uhh / sgmse

Score-based Generative Models (Diffusion Models) for Speech Enhancement and Dereverberation
MIT License
454 stars 69 forks source link

Wrong output shape in enhancement.py #47

Closed PavelPanjaya closed 4 months ago

PavelPanjaya commented 4 months ago

The shape of the output tensor x_hat is torch.Size([2, 758949]) which can't be written as audio file because of the first dimension.

cobalamin commented 4 months ago

The model is designed for single-channel speech enhancement; you may be providing a stereo input file?

PavelPanjaya commented 4 months ago

Thanks for your response, I've converted my audio to mono and it works now. Thanks.

cobalamin commented 4 months ago

Great! Closing this issue.