Open a897456 opened 6 months ago
Hi,@lucidrains
I trained it using Part 3 of Usage, which will take 100k steps, and 1k steps per epoch, so complete the training will use100 epochs. It should be able to generate 100 .flac
files and 100 .pt
files. At present, I have listened to the 51st generated .flac
file and felt that it was white noise. What's going on, please?
Hi @lucidrains Logically, when epoch=50, I should produce an audio file that doesn't sound like white noise, right? but, so far, the output of two files sound like white noise, do you know how to solve it? Please,THS
HI @lucidrains Does this mean that only two sets of batch are involved in the loss calculation at each step?
In Usage:
loss = diffusion(raw_audio)
loss.backward()
Thank you for your work, very nice! And I'm sorry, as a newbie, I have to ask two stupid questions: