Open huytuong010101 opened 2 years ago
Because it use SGD optimizer so do you think that the batch-size does not affect the loss? I just use the batch-size = 4
I think the most critical problem is the lack of training data. I recommend you to collect more. Also, why don’t you add momentum in sgd?
@huytuong010101 Here's the plot :grinning:
@huytuong010101 Here's the plot 😀
Wow thank you so muchhhhhhhhh <3
Hi @CauchyComplete again 💯 I am implementing your model with
tensorflow
. I try training it with CASIAv2/Tp (without pretrain) in 6 epoch but loss function reduce very low and not reduce since epoch 5. Can you share the plot of you loss without pretrain?