Closed basaltzhang closed 6 years ago
Hi @basaltzhang . I am still working on the implementation. From my experience that occurs when the learning rate is too high, leading to the classifier predicting the same output for all images.
With Adam optimizer I was able to get >80% accuracy, but with SGD+momentum as referred in the paper I am still trying to reproduce the results.
When I end the implementation I will update it in the Readme.
Despite that feel free to reach out if you have any doubts or if you find any bug!
I added some early stop conditions to avoid this issue, so i'll close this issue.
Fell free to open it if you find it necessary!
If you find another bug, do not hesitate to open another issue!
I changed the max train iteration to 18000, and after the training process the evaluation result shows below:
Is that the same with you?