alexrame / mixmo-pytorch

Official Pytorch implementation of MixMo framework
Other
84 stars 16 forks source link

Training speed of mixmo is very slow #1

Closed chenshen03 closed 3 years ago

chenshen03 commented 3 years ago

I followed your instructions to train mixmo on cifar-100. However, the training is very slow compared to the baseline. Can you tell me how much time it takes to train mixmo on CIFAR-100 in your experiments?

$ python3 scripts/train.py --config_path config/cifar100/exp_cifar100_wrn2810-2_cutmixmo-p5_msdacutmix_bar4.yaml --dataplace $dataplace --saveplace $saveplace
alexrame commented 3 years ago

Thank you for your interest. Indeed MixMo performs best with batch repetition (first introduced in MIMO https://arxiv.org/pdf/2010.06610.pdf). This option file has batch repetition set to 4, therefore each image is duplicated 4 times in a batch: overall, each epoch is 4 times longer. Note that we compare different approaches at equal training time in Section 4.3.1 from our paper (https://arxiv.org/pdf/2103.06132.pdf) and that most concurrent approaches also lengthen training. I hope this answers your question.

chenshen03 commented 3 years ago

Thank you very much. Becides, I ran the experiment exp_cifar100_wrn2810-2_cutmixmo-p5_msdacutmix_bar4.yaml for several days and the best accuracy in epoch 158 is currently 92.03%, which still well below the 95.77% reported in the paper. Can you share your training log? Thank you very much!

########################################
epoch                       158
general/classifier_lr       0.00125
general/l2_reg              3.795e+03
test/accuracy               82.03%
test/accuracy_0             78.45%
test/accuracy_1             79.68%
test/accuracy_mean          79.06%
test/accuracytop5           96.26%
test/auc                    0.894054
test/brier                  0.00250124
test/ce0                    0.79515
test/ce1                    0.73879
test/diversity_ratioerrors  1.0451
test/ece                    0.0234672
test/main                   2.6724
test/nll                    0.626094
train/ce0                   1.2476
train/ce1                   1.2436
train/main                  3.6286
########################################
alexrame commented 3 years ago

That's totally normal to have around 82% at epoch 158. Accuracies are sharply increasing at each learning rate update, occurring at epochs100, 200, and 225.
Here I provide the logs for the best performing run that reaches 85.92 at epoch 273.

epoch,test/accuracy,test/accuracytop5,test/accuracy_mean,test/accuracy_0,test/accuracy_1,train/accuracy,checkpoint_saved 1,9.12,29.13,08.745,07.98,09.51,4.31,0.0 2,14.41,38.89,12.88,11.61,14.15,8.94,0.0 3,22.47,53.19,20.795,20.15,21.44,13.14,0.0 4,30.08,62.93,28.05,25.87,30.23,16.60,0.0 5,32.88,65.62,30.535,30.46,30.61,20.15,0.0 6,36.93,72.25,35.065,35.31,34.82,22.66,0.0 7,40.23,73.99,36.46,36.25,36.67,25.23,0.0 8,43.39,76.11,37.76,40.01,35.51,27.41,0.0 9,48.08,80.32,42.775,43.02,42.53,29.27,0.0 10,50.56,81.63,46.36,45.85,46.87,31.07,0.0 11,50.11,82.21,46.095,44.2,47.99,32.32,0.0 12,51.43,83.04,47.915,48.01,47.82,33.76,0.0 13,54.25,84.54,49.935,51.5,48.37,34.48,0.0 14,51.91,83.49,47.755,48.38,47.13,36.10,0.0 15,53.29,85.46,50.115,50.48,49.75,36.75,1.0 16,54.01,84.61,50.2,49.63,50.77,37.57,0.0 17,56.67,86.04,52.26,52.88,51.64,38.27,0.0 18,60.02,88.17,56.23,54.39,58.07,38.55,0.0 19,56.17,85.12,51.53,53.79,49.27,39.47,0.0 20,59.80,86.84,55.51,54.68,56.34,39.65,0.0 21,59.96,87.79,54.535,54.8,54.27,40.29,0.0 22,58.71,86.23,54.92,53.85,55.99,40.80,0.0 23,59.92,87.97,55.815,54.91,56.72,41.30,0.0 24,61.14,88.59,57.5,57.41,57.59,41.55,0.0 25,61.60,88.22,57.095,57.09,57.1,41.62,0.0 26,62.69,89.23,58.6,58.31,58.89,42.15,0.0 27,62.95,89.73,58.555,56.36,60.75,42.34,0.0 28,59.57,88.20,56.06,55.41,56.71,42.54,0.0 29,62.29,89.05,58.57,59.04,58.1,42.93,0.0 30,63.07,89.51,58.72,58.66,58.78,43.24,1.0 31,60.87,87.71,57.31,58.11,56.51,43.56,0.0 32,61.63,88.58,56.3,54.07,58.53,43.69,0.0 33,63.81,89.75,60.41,60.81,60.01,43.84,0.0 34,63.12,89.11,58.34,58.17,58.51,43.94,0.0 35,59.36,87.13,56.31,55.63,56.99,43.73,0.0 36,63.57,89.58,58.84,57.69,59.99,44.07,0.0 37,62.90,89.27,59.145,59.6,58.69,44.41,0.0 38,62.71,89.36,57.17,59.56,54.78,44.62,0.0 39,64.47,90.19,60.32,61.34,59.3,44.68,0.0 40,65.94,90.85,61.4,61.05,61.75,45.03,0.0 41,65.18,90.25,61.01,61.59,60.43,45.01,0.0 42,63.57,89.94,59.46,60.81,58.11,45.00,0.0 43,63.28,89.62,60.365,60.83,59.9,44.95,0.0 44,62.12,87.90,57.55,58.87,56.23,44.95,0.0 45,65.06,90.75,60.695,60.25,61.14,45.30,1.0 46,64.92,90.48,59.815,58.62,61.01,45.47,0.0 47,65.55,90.78,60.375,60.0,60.75,45.32,0.0 48,63.47,89.87,58.75,57.22,60.28,45.93,0.0 49,65.52,89.60,60.77,58.25,63.29,45.74,0.0 50,64.31,90.25,60.585,61.49,59.68,45.85,0.0 51,65.29,90.28,60.59,60.02,61.16,45.69,0.0 52,64.80,90.35,61.17,61.86,60.48,45.92,0.0 53,65.67,90.53,61.335,60.55,62.12,46.08,0.0 54,67.10,91.29,61.76,61.71,61.81,46.44,0.0 55,64.39,89.20,58.44,54.73,62.15,46.29,0.0 56,66.71,91.11,62.78,61.25,64.31,46.27,0.0 57,66.45,91.44,62.445,61.48,63.41,46.40,0.0 58,67.12,91.62,62.8,62.97,62.63,46.48,0.0 59,64.23,90.77,60.3,58.66,61.94,46.81,0.0 60,67.20,91.39,62.865,63.48,62.25,46.37,1.0 61,67.76,91.96,63.825,63.22,64.43,46.84,0.0 62,66.24,91.60,62.14,61.06,63.22,46.60,0.0 63,66.46,90.63,62.955,63.17,62.74,46.47,0.0 64,66.90,90.73,62.9,63.14,62.66,46.67,0.0 65,66.80,91.07,62.975,64.3,61.65,47.34,0.0 66,68.03,91.86,63.615,65.14,62.09,46.61,0.0 67,68.01,91.83,63.955,63.9,64.01,47.01,0.0 68,66.04,90.56,61.555,61.94,61.17,47.20,0.0 69,67.59,91.01,63.335,62.71,63.96,47.09,0.0 70,66.31,90.50,62.125,61.66,62.59,47.23,0.0 71,66.92,90.88,62.605,62.62,62.59,47.14,0.0 72,64.25,90.50,60.645,59.39,61.9,47.41,0.0 73,68.19,91.65,63.84,63.36,64.32,47.01,0.0 74,67.04,91.30,62.985,62.8,63.17,47.32,0.0 75,67.46,90.59,62.07,63.72,60.42,47.46,1.0 76,66.76,91.30,62.815,64.02,61.61,47.22,0.0 77,67.92,92.29,63.555,63.1,64.01,47.06,0.0 78,66.89,91.07,62.075,62.06,62.09,47.62,0.0 79,68.39,91.31,63.635,63.76,63.51,47.65,0.0 80,66.62,91.13,61.745,62.24,61.25,47.64,0.0 81,65.99,90.09,60.955,58.77,63.14,47.72,0.0 82,67.42,91.22,61.89,62.42,61.36,47.77,0.0 83,68.13,91.94,63.915,64.06,63.77,47.84,0.0 84,66.89,91.12,62.45,60.83,64.07,47.55,0.0 85,67.56,91.15,63.235,64.93,61.54,47.84,0.0 86,66.99,91.12,62.405,62.2,62.61,47.69,0.0 87,68.81,92.12,63.775,64.77,62.78,47.84,0.0 88,64.46,89.98,60.025,61.68,58.37,47.96,0.0 89,66.35,90.81,62.27,63.25,61.29,47.78,0.0 90,65.25,90.77,60.445,57.25,63.64,48.03,1.0 91,66.90,90.87,62.23,64.34,60.12,47.82,0.0 92,65.18,90.52,61.,003.4,61.66,47.63,0.0 93,65.92,90.82,62.07,62.39,61.75,47.99,0.0 94,68.22,92.07,63.385,62.38,64.39,47.63,0.0 95,66.10,91.15,61.32,60.5,62.14,47.82,0.0 96,67.87,90.67,64.12,63.29,64.95,47.86,0.0 97,67.17,91.33,62.935,63.97,61.9,47.95,0.0 98,67.26,90.36,62.54,63.06,62.02,48.13,0.0 99,68.97,91.59,64.42,65.62,63.22,47.75,0.0 100,65.10,90.20,61.74,62.75,60.73,47.99,0.0 101,79.10,95.88,77.37,77.18,77.56,59.90,0.0 102,80.07,96.53,78.71,78.65,78.77,62.74,0.0 103,80.65,96.46,79.065,79.24,78.89,63.99,0.0 104,80.36,96.54,78.915,78.92,78.91,64.84,0.0 105,81.46,96.67,79.79,79.76,79.82,65.56,1.0 106,81.56,96.81,79.84,79.93,79.75,66.12,0.0 107,81.91,96.87,80.275,80.57,79.98,66.70,0.0 108,81.85,96.96,80.14,80.22,80.06,67.08,0.0 109,81.72,96.90,79.995,80.24,79.75,67.52,0.0 110,81.77,96.86,80.145,79.89,80.4,68.30,0.0 111,82.15,96.87,80.19,80.19,80.19,68.23,0.0 112,81.83,96.65,80.125,80.38,79.87,68.30,0.0 113,82.52,96.74,80.215,80.28,80.15,68.83,0.0 114,82.15,96.70,80.025,79.93,80.12,69.03,0.0 115,81.81,96.64,80.11,80.07,80.15,68.96,0.0 116,82.30,96.88,80.22,80.35,80.09,68.90,0.0 117,82.43,96.92,80.245,80.25,80.24,69.78,0.0 118,82.28,96.74,80.37,80.82,79.92,69.48,0.0 119,82.53,96.90,80.45,80.68,80.22,70.12,0.0 120,81.88,96.67,79.66,79.7,79.62,69.69,1.0 121,81.87,96.61,79.45,79.27,79.63,70.21,0.0 122,81.77,96.69,79.65,79.59,79.71,70.05,0.0 123,82.21,96.79,80.05,80.22,79.88,69.80,0.0 124,81.77,96.66,78.98,78.74,79.22,70.16,0.0 125,81.86,96.86,79.505,79.55,79.46,70.79,0.0 126,82.47,96.66,80.185,79.97,80.4,70.25,0.0 127,81.86,96.64,79.505,79.64,79.37,70.34,0.0 128,81.86,96.54,79.475,79.4,79.55,70.59,0.0 129,81.95,96.70,79.405,79.16,79.65,70.46,0.0 130,82.17,96.67,79.695,79.6,79.79,70.89,0.0 131,81.63,96.66,79.05,79.31,78.79,70.86,0.0 132,81.92,96.70,78.89,78.99,78.79,71.00,0.0 133,82.31,96.56,79.725,79.69,79.76,70.70,0.0 134,82.05,96.47,79.51,79.38,79.64,70.63,0.0 135,81.57,96.39,79.23,79.13,79.33,71.36,1.0 136,82.13,96.77,79.425,79.7,79.15,71.25,0.0 137,82.48,96.62,79.685,79.51,79.86,71.21,0.0 138,82.16,96.73,79.57,80.11,79.03,71.26,0.0 139,81.66,96.24,79.145,79.18,79.11,71.55,0.0 140,82.66,96.79,80.075,80.39,79.76,71.48,0.0 141,81.95,96.40,79.63,79.27,79.99,71.43,0.0 142,82.46,96.63,79.73,79.62,79.84,71.43,0.0 143,82.03,96.35,79.08,79.2,78.96,71.58,0.0 144,81.90,96.72,79.265,79.59,78.94,71.70,0.0 145,81.24,96.44,78.32,78.97,77.67,71.83,0.0 146,81.65,96.20,78.85,78.74,78.96,71.74,0.0 147,82.18,96.61,79.46,79.52,79.4,71.76,0.0 148,81.85,96.52,78.97,79.22,78.72,71.78,0.0 149,82.49,96.58,79.605,79.77,79.44,71.87,0.0 150,82.17,96.55,79.58,79.67,79.49,71.74,1.0 151,82.69,96.58,80.02,80.05,79.99,71.94,0.0 152,81.93,96.34,79.11,79.1,79.12,72.43,0.0 153,82.00,96.42,78.96,78.61,79.31,72.34,0.0 154,82.15,96.67,79.64,79.29,79.99,72.38,0.0 155,81.31,96.62,78.72,78.84,78.6,72.67,0.0 156,81.94,96.38,79.21,79.31,79.11,71.97,0.0 157,81.84,96.47,79.11,78.79,79.43,72.51,0.0 158,82.10,96.53,79.455,79.74,79.17,72.15,0.0 159,81.86,96.46,79.315,79.46,79.17,72.35,0.0 160,81.93,96.56,79.055,79.02,79.09,72.78,0.0 161,81.20,96.13,78.81,79.29,78.33,72.61,0.0 162,81.29,96.27,78.61,78.16,79.06,72.56,0.0 163,81.58,96.21,79.025,78.64,79.41,72.74,0.0 164,81.54,96.16,78.86,79.32,78.4,72.74,0.0 165,81.84,96.43,79.11,79.27,78.95,72.97,1.0 166,81.52,96.38,78.835,79.02,78.65,72.82,0.0 167,82.19,96.43,79.34,79.43,79.25,72.65,0.0 168,82.26,96.38,79.35,79.43,79.27,72.71,0.0 169,81.19,96.34,78.175,78.6,77.75,73.10,0.0 170,81.81,96.49,78.815,79.07,78.56,72.92,0.0 171,82.41,96.63,79.765,79.74,79.79,73.05,0.0 172,82.01,96.27,78.76,78.05,79.47,73.07,0.0 173,81.75,96.48,78.935,79.17,78.7,72.68,0.0 174,81.72,96.45,79.115,79.59,78.64,73.22,0.0 175,81.91,96.48,79.4,79.42,79.38,73.72,0.0 176,82.01,96.69,79.275,79.32,79.23,73.40,0.0 177,81.98,96.43,79.035,79.18,78.89,73.49,0.0 178,82.47,96.44,79.79,79.39,80.19,73.02,0.0 179,82.16,96.25,79.265,79.67,78.86,73.22,0.0 180,82.11,96.50,79.58,79.83,79.33,73.54,1.0 181,81.97,96.41,78.85,78.67,79.03,73.73,0.0 182,81.57,96.27,79.01,78.77,79.25,73.01,0.0 183,81.80,96.62,78.865,78.56,79.17,73.48,0.0 184,81.53,96.51,78.83,78.73,78.93,73.27,0.0 185,81.94,96.28,79.1,79.12,79.08,73.53,0.0 186,83.01,96.38,80.245,80.25,80.24,73.49,1.0 187,80.86,96.06,77.705,78.39,77.02,73.53,0.0 187,82.41,96.41,79.715,79.84,79.59,73.93,1.0 188,81.76,96.54,78.81,78.85,78.77,73.76,0.0 189,82.39,96.48,79.405,79.34,79.47,73.36,0.0 190,81.75,96.25,78.745,78.77,78.72,73.97,0.0 191,81.96,96.33,78.87,79.16,78.58,73.59,0.0 192,81.43,96.29,78.305,77.56,79.05,73.63,0.0 193,81.79,96.18,78.985,79.09,78.88,73.25,0.0 194,82.15,96.44,79.19,79.74,78.64,74.06,0.0 195,81.60,96.30,78.74,79.11,78.37,73.79,1.0 196,82.20,96.29,79.06,79.,091.2,74.03,0.0 197,81.87,96.30,79.15,79.47,78.83,74.02,0.0 198,82.25,96.34,79.035,79.26,78.81,73.86,0.0 199,82.12,96.60,79.22,78.77,79.67,73.79,0.0 200,81.95,96.43,79.365,79.55,79.18,73.37,0.0 201,84.75,97.04,82.79,82.89,82.69,78.14,1.0 202,84.89,97.06,82.975,83.3,82.65,79.53,1.0 203,85.18,97.18,83.095,83.34,82.85,79.61,1.0 204,85.15,97.23,83.25,83.68,82.82,80.14,0.0 205,85.01,97.26,83.23,83.32,83.14,80.38,0.0 206,85.11,97.30,83.245,83.46,83.03,80.88,0.0 207,85.01,97.24,83.42,83.87,82.97,80.95,0.0 208,85.09,97.13,83.49,83.62,83.36,80.67,0.0 209,85.36,97.29,83.555,83.84,83.27,80.99,1.0 210,85.30,97.33,83.495,83.64,83.35,80.75,1.0 211,85.34,97.32,83.4,83.57,83.23,81.07,0.0 212,85.33,97.30,83.52,83.67,83.37,81.07,0.0 213,85.38,97.26,83.68,83.8,83.56,81.46,1.0 214,85.56,97.35,83.715,83.84,83.59,81.52,1.0 215,85.49,97.27,83.73,83.86,83.6,81.49,0.0 216,85.41,97.33,83.635,83.79,83.48,81.62,0.0 217,85.42,97.41,83.54,83.58,83.5,81.49,0.0 218,85.34,97.28,83.255,83.38,83.13,81.64,0.0 219,85.51,97.19,83.8,84.01,83.59,81.95,0.0 220,85.34,97.28,83.675,83.91,83.44,81.84,0.0 221,85.43,97.32,83.8,84.03,83.57,82.01,0.0 222,85.31,97.42,83.51,83.79,83.23,81.56,0.0 223,85.42,97.35,83.62,83.83,83.41,81.70,0.0 224,85.32,97.28,83.48,83.67,83.29,81.99,0.0 225,85.52,97.25,83.8,83.83,83.77,82.13,1.0 226,85.60,97.25,83.62,83.86,83.38,82.58,1.0 227,85.51,97.28,83.95,84.18,83.72,82.29,0.0 228,85.60,97.40,83.875,84.12,83.63,82.64,1.0 229,85.72,97.40,83.955,84.09,83.82,82.52,1.0 230,85.67,97.38,83.89,84.12,83.66,82.51,0.0 231,85.52,97.30,83.65,83.81,83.49,82.64,0.0 232,85.76,97.48,84.02,84.25,83.79,82.79,1.0 233,85.39,97.40,83.8,83.97,83.63,82.57,0.0 234,85.57,97.38,83.835,84.07,83.6,82.47,0.0 235,85.48,97.26,83.89,84.21,83.57,82.55,0.0 236,85.73,97.29,83.825,83.97,83.68,83.15,0.0 237,85.65,97.38,83.735,83.95,83.52,82.77,0.0 238,85.69,97.35,84.065,84.24,83.89,82.48,0.0 239,85.78,97.36,83.835,84.01,83.66,82.64,1.0 240,85.65,97.44,83.95,84.2,83.7,82.99,1.0 241,85.50,97.34,83.68,83.86,83.5,82.71,0.0 242,85.58,97.32,83.95,84.17,83.73,82.77,0.0 243,85.62,97.37,83.92,84.06,83.78,82.60,0.0 244,85.79,97.38,84.02,84.17,83.87,82.73,1.0 245,85.75,97.30,83.945,84.17,83.72,82.94,0.0 246,85.69,97.40,84.04,84.19,83.89,82.47,0.0 247,85.75,97.33,84.04,84.33,83.75,82.55,0.0 248,85.72,97.34,83.995,84.28,83.71,82.56,0.0 249,85.56,97.31,84.075,84.37,83.78,82.73,0.0 250,85.62,97.22,83.7,83.84,83.56,82.67,0.0 251,85.70,97.34,84.0,84.18,83.82,82.64,0.0 252,85.53,97.32,83.88,84.13,83.63,82.75,0.0 253,85.60,97.42,83.935,84.11,83.76,83.00,0.0 254,85.67,97.32,83.98,84.18,83.78,82.69,0.0 255,85.60,97.30,83.83,84.03,83.63,82.57,1.0 256,85.64,97.35,83.97,84.21,83.73,82.74,0.0 257,85.73,97.31,83.85,84.15,83.55,83.08,0.0 258,85.69,97.33,83.88,84.2,83.56,82.91,0.0 259,85.62,97.37,83.925,84.12,83.73,82.86,0.0 260,85.61,97.32,83.88,84.2,83.56,82.84,0.0 261,85.70,97.30,83.945,84.2,83.69,82.87,0.0 262,85.82,97.44,84.12,84.31,83.93,82.92,1.0 263,85.72,97.26,83.82,84.09,83.55,82.46,0.0 264,85.70,97.40,84.07,84.29,83.85,83.01,0.0 265,85.58,97.33,83.86,84.19,83.53,83.12,0.0 266,85.65,97.31,83.89,84.04,83.74,83.01,0.0 267,85.63,97.30,83.69,83.86,83.52,82.85,0.0 268,85.69,97.39,84.08,84.38,83.78,82.85,0.0 269,85.63,97.39,84.1,84.33,83.87,82.92,0.0 270,85.62,97.39,83.91,84.07,83.75,82.74,1.0 271,85.69,97.37,83.785,84.08,83.49,83.20,0.0 272,85.62,97.42,83.91,84.14,83.68,82.59,0.0 273,85.92,97.36,84.085,84.29,83.88,82.97,1.0 274,85.66,97.37,83.77,84.06,83.48,83.13,0.0 275,85.75,97.38,84.085,84.23,83.94,82.84,0.0 276,85.89,97.35,83.85,84.14,83.56,83.18,0.0 277,85.80,97.39,84.01,84.11,83.91,83.33,0.0 278,85.70,97.33,84.025,84.3,83.75,83.52,0.0 279,85.71,97.33,84.1,84.39,83.81,83.86,0.0 280,85.79,97.27,84.0,84.3,83.7,84.30,0.0 281,85.82,97.37,84.03,84.28,83.78,84.29,0.0 282,85.85,97.31,84.13,84.43,83.83,84.52,0.0 283,85.76,97.38,84.09,84.33,83.85,85.12,0.0 284,85.76,97.34,84.055,84.43,83.68,85.20,0.0 285,85.71,97.30,84.02,84.25,83.79,85.27,1.0 286,85.76,97.30,84.01,84.25,83.77,85.75,0.0 287,85.74,97.27,84.075,84.28,83.87,86.15,0.0 288,85.83,97.24,84.045,84.22,83.87,85.92,0.0 289,85.71,97.40,84.205,84.28,84.13,86.27,0.0 290,85.78,97.31,84.165,84.39,83.94,86.78,0.0 291,85.78,97.31,84.05,84.29,83.81,86.95,0.0 292,85.62,97.28,83.96,84.18,83.74,87.18,0.0 293,85.59,97.34,83.97,84.14,83.8,87.49,0.0 294,85.59,97.33,84.,041.3,83.87,87.65,0.0 295,85.55,97.30,83.84,83.99,83.69,87.79,0.0 296,85.63,97.35,84.05,84.18,83.92,88.07,0.0 297,85.77,97.35,83.89,84.16,83.62,88.40,0.0 298,85.67,97.38,83.96,84.05,83.87,88.54,0.0 299,85.61,97.26,83.9,84.05,83.75,88.85,0.0 300,85.63,97.25,83.825,83.98,83.67,89.03,1.0