No convergence - Training AdderNet with Cifar10

AI-Huang / AdderNet-tf

TensorFlow implementation for paper "AdderNet: Do We Really Need Multiplications in Deep Learning?"

BSD 3-Clause "New" or "Revised" License

5 stars 0 forks source link

Hi Kan,

Thank you for your brilliant work for writing AdderNet in Tensorflow.

I learned from your codes that you have trained ResNet20-V1 and achieved 92.16% accuracy in Cifar10.

I tried to train the model with the following command, but found that it did not converge at all. The training loss and accuracy at 100 epoch are same as that of the first 5 epoch, whose screenshots are attached below.

python train_addernet_cifar10.py --data_preprocessing "subtract_pixel_mean" --lr_schedule "cifar10_scheduler" --dataset "cifar10" --use_addernet --data_preprocessing subtract_pixel_mean --epochs 300 --batch_size 64

I would be appreciate it if you could let me know what changes I should make.

Best Regard, Shawn

Epoch 1-5: 截屏2022-12-15 00 32 48

Epoch 100:

截屏2022-12-15 00 36 44

AI-Huang / AdderNet-tf

No convergence - Training AdderNet with Cifar10 #1