Thank you for sharing your code.
I'm interested in your work ang trying to replicate your CIFAR10 results.
However I have two issues.
I only got 61% when I run TrainCifar10_S.py without loading weights and run TestCifar10_S.py, though pre-trained weights shows 64% when I run TestCifar10_S.py. Could you let me know how to generate your pre-trained weights from scratch?
In your paper, you wrote "Therefore, we limited the maximum allowed row-normalized Frobenius norm of the gradient of each weight matrix to 10." at page 5. Could you let me know which part of the code implement this feature.
Thank you for sharing your code. I'm interested in your work ang trying to replicate your CIFAR10 results. However I have two issues.
I only got 61% when I run TrainCifar10_S.py without loading weights and run TestCifar10_S.py, though pre-trained weights shows 64% when I run TestCifar10_S.py. Could you let me know how to generate your pre-trained weights from scratch?
In your paper, you wrote "Therefore, we limited the maximum allowed row-normalized Frobenius norm of the gradient of each weight matrix to 10." at page 5. Could you let me know which part of the code implement this feature.