Hello, during the training process, what is the typical value of the loss? Does it converge quickly? Currently, I'm facing a situation where it doesn't converge. #1
Hello, during the training process, what is the typical value of the loss? Does it converge quickly? Currently, I'm facing a situation where it doesn't converge.
Hello, during the training process, what is the typical value of the loss? Does it converge quickly? Currently, I'm facing a situation where it doesn't converge.