training loss is nan - Githubissues

jasonppy / VoiceCraft

Zero-Shot Speech Editing and Text-to-Speech in the Wild

Other

7.65k stars 749 forks source link

training loss is nan #81

Closed WoBuChiTang closed 7 months ago

WoBuChiTang commented 7 months ago

Hello, I am training Chinese data with a weight of 830m. At the beginning, the loss is normal, but after 2000 steps, there is a situation of "loss is nan" in a large area. May I ask if you have encountered it when training gigaspeech?

jasonppy commented 7 months ago

use AdamW with smaller learning rate

see finetuning section in readme

WoBuChiTang commented 7 months ago

use AdamW with smaller learning rate

see finetuning section in readme

Thank you!

Magauiya commented 2 months ago

Hi @WoBuChiTang! Have you fixed that error? Did suggestions (reducing the learning rate and using AdamW) from Jason help? If yes, can you please share hyperparameters that you have used?