facebookresearch / AudioDec

An Open-source Streaming High-fidelity Neural Audio Codec
Other
400 stars 21 forks source link

[tensorboard] Show? #33

Closed a897456 closed 1 month ago

a897456 commented 1 month ago

Hi @bigpon Can you show the training process graph of 'symAD_libritts_24000_hop300' ? It is implemented through the tensorboard command. Because I found that my training graph is chaotic, as shown in the following figure. image

bigpon commented 1 month ago

Hi, I don't preserve the results. However, your results seem reasonable. The mel loss will become higher in the GAN training stage.

a897456 commented 1 month ago

However, your results seem reasonable.

  1. I found that the loss curve of the generator is relatively smooth, but the loss curve of the discriminator fluctuates greatly. I think there may be a problem, so I want to see if your results are the same.
  2. I think there is a second problem with this graph, that is, neither the loss curve of the generator nor the loss curve of the discriminator has tended to be constant. Should the step size be increased and the training continued until the curves tend to be stable?
bigpon commented 1 month ago

Both are normal. GAN training is not stable since we alternatively update the generator and discriminator.

a897456 commented 1 month ago
  1. I think there is a second problem with this graph, that is, neither the loss curve of the generator nor the loss curve of the discriminator has tended to be constant. Should the step size be increased and the training continued until the curves tend to be stable?

Do you think it is necessary to continue increasing the step size for training in this issue?

bigpon commented 1 month ago

You can try it. I didn't find significant perceptual differences for training longer.