Hello everyone, I have run the cifar10.py (set batch_size=128 as in the paper, and set data_augmentation=false, using sgd). My result shows that the training accuracy converges to 1 after 30 epochs, but the testing accuracy converges to 0.6. I do not understand why training accuracy converges so fast (much faster than the results in paper). Does anyone has an idea of the reason?
Hello everyone, I have run the cifar10.py (set batch_size=128 as in the paper, and set data_augmentation=false, using sgd). My result shows that the training accuracy converges to 1 after 30 epochs, but the testing accuracy converges to 0.6. I do not understand why training accuracy converges so fast (much faster than the results in paper). Does anyone has an idea of the reason?
Thank you!