OutOfMemoryError: CUDA out of memory. Tried to allocate 15.25 GiB. GPU 0 has a total capacty of 23.69 GiB of which 5.50 GiB is free.

risqaliyevds commented 11 months ago

I have 24 GB GPU but I get this error. I try to use enhance my own audio with yours best checkpoints what is problem or it needs more gpu?

OutOfMemoryError: CUDA out of memory. Tried to allocate 15.25 GiB. GPU 0 has a total capacty of 23.69 GiB of which 5.50 GiB is free. Process 2674591 has 16.90 GiB memory in use. Including non-PyTorch memory, this process has 1.26 GiB memory in use.

yxlu-0102 commented 11 months ago

I used two 24 GB GPUs to train the model, so you just used one or somethings ?

risqaliyevds commented 10 months ago

I solved this problem. I have another question can I train a 48k rate audio model? If yes what I should change or it is impossible? I changed config.json file (to 48k) and trained my own model (get checkpoint) but it wasn't different from your best checkpoint.

yxlu-0102 commented 10 months ago

Our model was trained on a corpus with a 16 kHz sampling rate, where the original data was downsampled from 48 kHz to 16 kHz. It is feasible to train our model with a 48 kHz noisy dataset. You just need to change the sampling_rate in the config.json to 48000.

QQQQQQQQY commented 6 months ago

Hi, this has happened to me too, how did you solve it

yxlu-0102 commented 6 months ago

When did you encounter this issue? in training or inference?

QQQQQQQQY commented 6 months ago

Hi, I'm happy to hear from you, I came across it in training

risqaliyevds commented 6 months ago

Hi what is your audio sample rate?

QQQQQQQQY commented 6 months ago

16 kHz

yxlu-0102 commented 6 months ago

How many batches did you set on each GPU?

risqaliyevds commented 6 months ago

I cant remember you can start from 1.

пн, 22 апр. 2024 г., 12:54 Rory Lu @.***>:

QQQQQQQQY commented 6 months ago

Thank you very much for your reply, I will try

Johnsonabuse commented 4 months ago

作者您好请问您用什么型号什么数量的GPU训练的？我用的两张3090的训练您的模型每一个Epoch大概要1个小时 batchsize的参数设置为多少呢？谢谢

yxlu-0102 commented 4 months ago

我用的也是两张 3090，batchsize 设的还是 4

vkeep commented 3 months ago

I use default config, bs =4, segment size = 32000, 2*4090 24G gpu, also OOM.

will the result degrade when set the bs =2?

Johnsonabuse commented 3 months ago

Yes I set batchsize to 2 when I Use 2×3090

发自我的iPhone

------------------ Original ------------------ From: vkeep @.> Date: Tue,Jul 23,2024 3:11 PM To: yxlu-0102/MP-SENet @.> Cc: Johnsonabuse @.>, Comment @.> Subject: Re: [yxlu-0102/MP-SENet] OutOfMemoryError: CUDA out of memory. Triedto allocate 15.25 GiB. GPU 0 has a total capacty of 23.69 GiB of which 5.50GiB is free. (Issue #13)

作者您好请问您用什么型号什么数量的GPU训练的？我用的两张3090的训练您的模型每一个Epoch大概要1个小时 batchsize的参数设置为多少呢？谢谢

我用的也是两张 3090，batchsize 设的还是 4

I use default config, bs =4, segment size = 32000, 2*4090 24G gpu, also OOM.

will the result degrade when set the bs =2?

— Reply to this email directly, view it on GitHub, or unsubscribe. You are receiving this because you commented.Message ID: @.***>

Lijingze666 commented 2 months ago

Hello, I encountered this issue during the inference process. How can I resolve it?

yxlu-0102 / MP-SENet

OutOfMemoryError: CUDA out of memory. Tried to allocate 15.25 GiB. GPU 0 has a total capacty of 23.69 GiB of which 5.50 GiB is free. #13