yuhuan-wu / P2T

[TPAMI22] Pyramid Pooling Transformer for Scene Understanding
200 stars 18 forks source link

Training log of ImageNet #13

Closed LMMMEng closed 1 year ago

LMMMEng commented 1 year ago

It seems that the ImageNet training log of the tiny version is incomplete, could you update it?

yuhuan-wu commented 1 year ago

There seems nothing wrong with ImagNet training log of P2T-Tiny. For convenience, I paste the log from our shared folder as below

{"train_lr": 0.0004998164668624689, "train_loss": 3.5743932656104045, "epoch": 152, "n_parameters": 11590840}
{"train_lr": 0.0004946335021577291, "train_loss": 3.5612054719722908, "epoch": 153, "n_parameters": 11590840}
{"train_lr": 0.0004894516742563275, "train_loss": 3.56016564652693, "epoch": 154, "n_parameters": 11590840}
{"train_lr": 0.0004842715514040588, "train_loss": 3.5652903384275194, "epoch": 155, "n_parameters": 11590840}
{"train_lr": 0.00047909370165973703, "train_loss": 3.5613126419097494, "epoch": 156, "n_parameters": 11590840}
{"train_lr": 0.00047391869283298395, "train_loss": 3.555761627132277, "epoch": 157, "n_parameters": 11590840}
{"train_lr": 0.0004687470924218306, "train_loss": 3.554628330883648, "epoch": 158, "n_parameters": 11590840}
{"train_lr": 0.0004635794675504931, "train_loss": 3.535118740906628, "epoch": 159, "n_parameters": 11590840}
{"train_lr": 0.0004584163849073357, "train_loss": 3.5404944376265117, "test_loss": 1.1421642118486866, "test_acc1": 74.286002109375, "test_acc5": 92.60000264648437, "epoch": 160, "n_parameters": 11590840}
{"train_lr": 0.0004532584106825094, "train_loss": 3.553695835226731, "epoch": 161, "n_parameters": 11590840}
{"train_lr": 0.0004481061105060409, "train_loss": 3.54094214683814, "epoch": 162, "n_parameters": 11590840}
{"train_lr": 0.00044296004938566933, "train_loss": 3.523124451176535, "epoch": 163, "n_parameters": 11590840}
{"train_lr": 0.0004378207916450146, "train_loss": 3.5339377294484375, "epoch": 164, "n_parameters": 11590840}
{"train_lr": 0.00043268890086160794, "train_loss": 3.515914784942409, "epoch": 165, "n_parameters": 11590840}
{"train_lr": 0.00042756493980507503, "train_loss": 3.517750824431626, "epoch": 166, "n_parameters": 11590840}
{"train_lr": 0.0004224494703755201, "train_loss": 3.5138288475126385, "epoch": 167, "n_parameters": 11590840}
{"train_lr": 0.00041734305354179923, "train_loss": 3.5127825907570758, "epoch": 168, "n_parameters": 11590840}
{"train_lr": 0.0004122462492800569, "train_loss": 3.5142803016207296, "epoch": 169, "n_parameters": 11590840}
{"train_lr": 0.0004071596165123325, "train_loss": 3.5071192180557693, "test_loss": 1.1510255648486916, "test_acc1": 74.8600022265625, "test_acc5": 92.81800235351562, "epoch": 170, "n_parameters": 11590840}
{"train_lr": 0.0004020837130452182, "train_loss": 3.5065351560247318, "epoch": 171, "n_parameters": 11590840}
{"train_lr": 0.00039701909550871056, "train_loss": 3.4980822681642167, "epoch": 172, "n_parameters": 11590840}
{"train_lr": 0.0003919663192952229, "train_loss": 3.4923600166035498, "epoch": 173, "n_parameters": 11590840}
{"train_lr": 0.00038692593849859734, "train_loss": 3.48667043068712, "epoch": 174, "n_parameters": 11590840}
{"train_lr": 0.0003818985058534034, "train_loss": 3.4858136311423578, "epoch": 175, "n_parameters": 11590840}
{"train_lr": 0.0003768845726742596, "train_loss": 3.4909114401808363, "epoch": 176, "n_parameters": 11590840}
{"train_lr": 0.0003718846887954506, "train_loss": 3.480566321898231, "epoch": 177, "n_parameters": 11590840}
{"train_lr": 0.00036689940251057154, "train_loss": 3.4779263569487275, "epoch": 178, "n_parameters": 11590840}
{"train_lr": 0.0003619292605124837, "train_loss": 3.471419211474063, "epoch": 179, "n_parameters": 11590840}
{"train_lr": 0.0003569748078332423, "train_loss": 3.471359407158493, "test_loss": 1.0835051923647694, "test_acc1": 75.71200217773438, "test_acc5": 93.12200203125, "epoch": 180, "n_parameters": 11590840}
{"train_lr": 0.00035203658778439235, "train_loss": 3.46441358041992, "epoch": 181, "n_parameters": 11590840}
{"train_lr": 0.0003471151418974503, "train_loss": 3.455791411687621, "epoch": 182, "n_parameters": 11590840}
{"train_lr": 0.0003422110098644085, "train_loss": 3.4613385772009453, "epoch": 183, "n_parameters": 11590840}
{"train_lr": 0.0003373247294785742, "train_loss": 3.4543474361860302, "epoch": 184, "n_parameters": 11590840}
{"train_lr": 0.0003324568365756944, "train_loss": 3.4412438851847447, "epoch": 185, "n_parameters": 11590840}
{"train_lr": 0.00032760786497508304, "train_loss": 3.4446582230542013, "epoch": 186, "n_parameters": 11590840}
{"train_lr": 0.00032277834642108216, "train_loss": 3.447267726123285, "epoch": 187, "n_parameters": 11590840}
{"train_lr": 0.00031796881052486697, "train_loss": 3.4390437261854334, "epoch": 188, "n_parameters": 11590840}
{"train_lr": 0.0003131797847062025, "train_loss": 3.437155818207849, "epoch": 189, "n_parameters": 11590840}
{"train_lr": 0.0003084117941357799, "train_loss": 3.436713810316283, "test_loss": 1.0393913918071322, "test_acc1": 75.97800221679688, "test_acc5": 93.26800233398437, "epoch": 190, "n_parameters": 11590840}
{"train_lr": 0.00030366536167747524, "train_loss": 3.423190413797788, "epoch": 191, "n_parameters": 11590840}
{"train_lr": 0.00029894100783110664, "train_loss": 3.422523183966998, "epoch": 192, "n_parameters": 11590840}
{"train_lr": 0.0002942392506752879, "train_loss": 3.4283898516977245, "epoch": 193, "n_parameters": 11590840}
{"train_lr": 0.0002895606058107011, "train_loss": 3.4096099771708133, "epoch": 194, "n_parameters": 11590840}
{"train_lr": 0.0002849055863034561, "train_loss": 3.4102377335873726, "epoch": 195, "n_parameters": 11590840}
{"train_lr": 0.0002802747026289244, "train_loss": 3.401014165412322, "epoch": 196, "n_parameters": 11590840}
{"train_lr": 0.00027566846261567343, "train_loss": 3.401992839994095, "epoch": 197, "n_parameters": 11590840}
{"train_lr": 0.00027108737138981483, "train_loss": 3.4068873604591325, "epoch": 198, "n_parameters": 11590840}
{"train_lr": 0.00026653193131964784, "train_loss": 3.3921055701567973, "epoch": 199, "n_parameters": 11590840}
{"train_lr": 0.00026200264196050673, "train_loss": 3.3936248188682026, "test_loss": 1.0303690580801033, "test_acc1": 76.59400234863281, "test_acc5": 93.540002421875, "epoch": 200, "n_parameters": 11590840}
{"train_lr": 0.00025750000000000143, "train_loss": 3.389856409016464, "epoch": 201, "n_parameters": 11590840}
{"train_lr": 0.0002530244992035622, "train_loss": 3.3846808571895535, "epoch": 202, "n_parameters": 11590840}
{"train_lr": 0.00024857663036030185, "train_loss": 3.372419533814839, "epoch": 203, "n_parameters": 11590840}
{"train_lr": 0.00024415688122914167, "train_loss": 3.3688626083776914, "epoch": 204, "n_parameters": 11590840}
{"train_lr": 0.00023976573648539732, "train_loss": 3.362280931832979, "epoch": 205, "n_parameters": 11590840}
{"train_lr": 0.0002354036776675575, "train_loss": 3.364592643116685, "epoch": 206, "n_parameters": 11590840}
{"train_lr": 0.00023107118312454297, "train_loss": 3.3697193838137802, "epoch": 207, "n_parameters": 11590840}
{"train_lr": 0.00022676872796319747, "train_loss": 3.372238050464341, "epoch": 208, "n_parameters": 11590840}
{"train_lr": 0.00022249678399621184, "train_loss": 3.3461100655398686, "epoch": 209, "n_parameters": 11590840}
{"train_lr": 0.00021825581969037202, "train_loss": 3.3540834311148724, "test_loss": 1.0081777131877183, "test_acc1": 77.00800224121093, "test_acc5": 93.76800241210937, "epoch": 210, "n_parameters": 11590840}
{"train_lr": 0.0002140463001152288, "train_loss": 3.3574369996667004, "epoch": 211, "n_parameters": 11590840}
{"train_lr": 0.00020986868689201908, "train_loss": 3.3426170350312234, "epoch": 212, "n_parameters": 11590840}
{"train_lr": 0.00020572343814312388, "train_loss": 3.3400714560259255, "epoch": 213, "n_parameters": 11590840}
{"train_lr": 0.00020161100844177222, "train_loss": 3.3358767846291015, "epoch": 214, "n_parameters": 11590840}
{"train_lr": 0.0001975318487622333, "train_loss": 3.3342676946489838, "epoch": 215, "n_parameters": 11590840}
{"train_lr": 0.0001934864064303279, "train_loss": 3.3346634693830897, "epoch": 216, "n_parameters": 11590840}
{"train_lr": 0.00018947512507439562, "train_loss": 3.3282264391485925, "epoch": 217, "n_parameters": 11590840}
{"train_lr": 0.00018549844457663907, "train_loss": 3.3229453079372666, "epoch": 218, "n_parameters": 11590840}
{"train_lr": 0.00018155680102489234, "train_loss": 3.323198780619936, "epoch": 219, "n_parameters": 11590840}
{"train_lr": 0.00017765062666479713, "train_loss": 3.315860111698163, "test_loss": 0.978888113950861, "test_acc1": 77.4140024609375, "test_acc5": 93.85800254882813, "epoch": 220, "n_parameters": 11590840}
{"train_lr": 0.0001737803498523614, "train_loss": 3.320668717177747, "epoch": 221, "n_parameters": 11590840}
{"train_lr": 0.0001699463950070852, "train_loss": 3.3042723018702844, "epoch": 222, "n_parameters": 11590840}
{"train_lr": 0.00016614918256530037, "train_loss": 3.297677642006954, "epoch": 223, "n_parameters": 11590840}
{"train_lr": 0.0001623891289341519, "train_loss": 3.3023981789080836, "epoch": 224, "n_parameters": 11590840}
{"train_lr": 0.00015866664644587948, "train_loss": 3.2876113930003914, "epoch": 225, "n_parameters": 11590840}
{"train_lr": 0.00015498214331266302, "train_loss": 3.287639162987828, "epoch": 226, "n_parameters": 11590840}
{"train_lr": 0.00015133602358175863, "train_loss": 3.2908601643060513, "epoch": 227, "n_parameters": 11590840}
{"train_lr": 0.00014772868709131358, "train_loss": 3.281462552426435, "epoch": 228, "n_parameters": 11590840}
{"train_lr": 0.00014416052942639815, "train_loss": 3.285233736038208, "epoch": 229, "n_parameters": 11590840}
{"train_lr": 0.0001406319418757283, "train_loss": 3.26800053134906, "test_loss": 0.9623063921471665, "test_acc1": 77.96600250976563, "test_acc5": 94.17400252929687, "epoch": 230, "n_parameters": 11590840}
{"train_lr": 0.00013714331138869076, "train_loss": 3.2689359839037837, "epoch": 231, "n_parameters": 11590840}
{"train_lr": 0.00013369502053292505, "train_loss": 3.2619058941837125, "epoch": 232, "n_parameters": 11590840}
{"train_lr": 0.00013028744745238475, "train_loss": 3.255431350055549, "epoch": 233, "n_parameters": 11590840}
{"train_lr": 0.0001269209658258495, "train_loss": 3.2588716568373184, "epoch": 234, "n_parameters": 11590840}
{"train_lr": 0.0001235959448259827, "train_loss": 3.252390821012471, "epoch": 235, "n_parameters": 11590840}
{"train_lr": 0.00012031274907879949, "train_loss": 3.2490729571437, "epoch": 236, "n_parameters": 11590840}
{"train_lr": 0.00011707173862371172, "train_loss": 3.245385940757682, "epoch": 237, "n_parameters": 11590840}
{"train_lr": 0.00011387326887403272, "train_loss": 3.2360426683386834, "epoch": 238, "n_parameters": 11590840}
{"train_lr": 0.00011071769057802179, "train_loss": 3.234613025026451, "epoch": 239, "n_parameters": 11590840}
{"train_lr": 0.00010760534978039747, "train_loss": 3.2343959636348996, "test_loss": 0.9611213823844647, "test_acc1": 78.19800240234375, "test_acc5": 94.37800247070312, "epoch": 240, "n_parameters": 11590840}
{"train_lr": 0.00010453658778440302, "train_loss": 3.2259613303281514, "epoch": 241, "n_parameters": 11590840}
{"train_lr": 0.0001015117411143637, "train_loss": 3.2356295629108933, "epoch": 242, "n_parameters": 11590840}
{"train_lr": 9.853114147881454e-05, "train_loss": 3.231938961360285, "epoch": 243, "n_parameters": 11590840}
{"train_lr": 9.559511573409267e-05, "train_loss": 3.226304765347001, "epoch": 244, "n_parameters": 11590840}
{"train_lr": 9.270398584849976e-05, "train_loss": 3.2198906852592, "epoch": 245, "n_parameters": 11590840}
{"train_lr": 8.985806886701763e-05, "train_loss": 3.207187651885118, "epoch": 246, "n_parameters": 11590840}
{"train_lr": 8.705767687650155e-05, "train_loss": 3.212325418857838, "epoch": 247, "n_parameters": 11590840}
{"train_lr": 8.430311697149993e-05, "train_loss": 3.207398280441809, "epoch": 248, "n_parameters": 11590840}
{"train_lr": 8.159469122054745e-05, "train_loss": 3.20298746091237, "epoch": 249, "n_parameters": 11590840}
{"train_lr": 7.893269663304691e-05, "train_loss": 3.195464231282306, "test_loss": 0.9325173419554115, "test_acc1": 78.83600243652344, "test_acc5": 94.46600252929687, "epoch": 250, "n_parameters": 11590840}
{"train_lr": 7.631742512670381e-05, "train_loss": 3.201913250888661, "epoch": 251, "n_parameters": 11590840}
{"train_lr": 7.37491634955067e-05, "train_loss": 3.19635313899397, "epoch": 252, "n_parameters": 11590840}
{"train_lr": 7.122819337828824e-05, "train_loss": 3.1866823428040214, "epoch": 253, "n_parameters": 11590840}
{"train_lr": 6.875479122782832e-05, "train_loss": 3.180942725625446, "epoch": 254, "n_parameters": 11590840}
{"train_lr": 6.632922828055003e-05, "train_loss": 3.1880086358431146, "epoch": 255, "n_parameters": 11590840}
{"train_lr": 6.395177052675899e-05, "train_loss": 3.1830955958075755, "epoch": 256, "n_parameters": 11590840}
{"train_lr": 6.162267868149164e-05, "train_loss": 3.1728168588271624, "epoch": 257, "n_parameters": 11590840}
{"train_lr": 5.934220815591336e-05, "train_loss": 3.1835445344066926, "epoch": 258, "n_parameters": 11590840}
{"train_lr": 5.7110609029320316e-05, "train_loss": 3.174825050657411, "epoch": 259, "n_parameters": 11590840}
{"train_lr": 5.492812602170232e-05, "train_loss": 3.166238883964354, "test_loss": 0.939041365266303, "test_acc1": 78.96000243652344, "test_acc5": 94.67000270507812, "epoch": 260, "n_parameters": 11590840}
{"train_lr": 5.2794998466913834e-05, "train_loss": 3.167985329882418, "epoch": 261, "n_parameters": 11590840}
{"train_lr": 5.0711460286429e-05, "train_loss": 3.1589204837783256, "epoch": 262, "n_parameters": 11590840}
{"train_lr": 4.8677739963691566e-05, "train_loss": 3.1595042114444585, "epoch": 263, "n_parameters": 11590840}
{"train_lr": 4.669406051905346e-05, "train_loss": 3.16147943028062, "epoch": 264, "n_parameters": 11590840}
{"train_lr": 4.476063948531632e-05, "train_loss": 3.1560502262543335, "epoch": 265, "n_parameters": 11590840}
{"train_lr": 4.287768888388454e-05, "train_loss": 3.1515720459721166, "epoch": 266, "n_parameters": 11590840}
{"train_lr": 4.1045415201513915e-05, "train_loss": 3.1459253384864967, "epoch": 267, "n_parameters": 11590840}
{"train_lr": 3.926401936765786e-05, "train_loss": 3.147256157571654, "epoch": 268, "n_parameters": 11590840}
{"train_lr": 3.753369673244709e-05, "train_loss": 3.145818961026381, "epoch": 269, "n_parameters": 11590840}
{"train_lr": 3.585463704525412e-05, "train_loss": 3.142557013354046, "test_loss": 0.9168021891308927, "test_acc1": 79.23400267089843, "test_acc5": 94.70600247070313, "epoch": 270, "n_parameters": 11590840}
{"train_lr": 3.4227024433899046e-05, "train_loss": 3.1439421949483792, "epoch": 271, "n_parameters": 11590840}
{"train_lr": 3.2651037384443736e-05, "train_loss": 3.1346061845644297, "epoch": 272, "n_parameters": 11590840}
{"train_lr": 3.112684872162703e-05, "train_loss": 3.135633980567983, "epoch": 273, "n_parameters": 11590840}
{"train_lr": 2.9654625589913e-05, "train_loss": 3.1337282281461283, "epoch": 274, "n_parameters": 11590840}
{"train_lr": 2.8234529435159726e-05, "train_loss": 3.1363811073043077, "epoch": 275, "n_parameters": 11590840}
{"train_lr": 2.686671598691151e-05, "train_loss": 3.1295296204842917, "epoch": 276, "n_parameters": 11590840}
{"train_lr": 2.55513352413271e-05, "train_loss": 3.123510868655597, "epoch": 277, "n_parameters": 11590840}
{"train_lr": 2.4288531444729954e-05, "train_loss": 3.126117390849226, "epoch": 278, "n_parameters": 11590840}
{"train_lr": 2.3078443077785567e-05, "train_loss": 3.1179381135603985, "epoch": 279, "n_parameters": 11590840}
{"train_lr": 2.192120284031953e-05, "train_loss": 3.12479729363196, "test_loss": 0.9034121544653428, "test_acc1": 79.43600257324219, "test_acc5": 94.82000252929687, "epoch": 280, "n_parameters": 11590840}
{"train_lr": 2.0816937636766537e-05, "train_loss": 3.1183729855229054, "epoch": 281, "n_parameters": 11590840}
{"train_lr": 1.976576856224742e-05, "train_loss": 3.1258816430918412, "epoch": 282, "n_parameters": 11590840}
{"train_lr": 1.8767810889299472e-05, "train_loss": 3.1130714507387887, "epoch": 283, "n_parameters": 11590840}
{"train_lr": 1.7823174055225238e-05, "train_loss": 3.1153017404339582, "epoch": 284, "n_parameters": 11590840}
{"train_lr": 1.6931961650100128e-05, "train_loss": 3.1178496638647943, "epoch": 285, "n_parameters": 11590840}
{"train_lr": 1.609427140540658e-05, "train_loss": 3.1148206856396556, "epoch": 286, "n_parameters": 11590840}
{"train_lr": 1.5310195183320784e-05, "train_loss": 3.1152974600962504, "epoch": 287, "n_parameters": 11590840}
{"train_lr": 1.4579818966635116e-05, "train_loss": 3.109842925179872, "epoch": 288, "n_parameters": 11590840}
{"train_lr": 1.3903222849333426e-05, "train_loss": 3.114564793251401, "epoch": 289, "n_parameters": 11590840}
{"train_lr": 1.3280481027803718e-05, "train_loss": 3.1118165004453497, "test_loss": 0.8950902612958375, "test_acc1": 79.62200249023438, "test_acc5": 94.91200266601562, "epoch": 290, "n_parameters": 11590840}
{"train_lr": 1.2711661792704427e-05, "train_loss": 3.1051136378785498, "test_loss": 0.9018061433943753, "test_acc1": 79.5720026953125, "test_acc5": 94.87200235351563, "epoch": 291, "n_parameters": 11590840}
{"train_lr": 1.2196827521475628e-05, "train_loss": 3.109409333704759, "test_loss": 0.9035136366826821, "test_acc1": 79.66800276855469, "test_acc5": 94.89800244140625, "epoch": 292, "n_parameters": 11590840}
{"train_lr": 1.1736034671495227e-05, "train_loss": 3.114151931566586, "test_loss": 0.9021170568648883, "test_acc1": 79.62600262207032, "test_acc5": 94.86000256835938, "epoch": 293, "n_parameters": 11590840}
{"train_lr": 1.1329333773893123e-05, "train_loss": 3.106837773280178, "test_loss": 0.8939082184057127, "test_acc1": 79.76000228027344, "test_acc5": 94.88600251953125, "epoch": 294, "n_parameters": 11590840}
{"train_lr": 1.0976769428005425e-05, "train_loss": 3.1012234592275747, "test_loss": 0.8985905765459455, "test_acc1": 79.67400261230469, "test_acc5": 94.93800259765625, "epoch": 295, "n_parameters": 11590840}
{"train_lr": 1.067838029648576e-05, "train_loss": 3.0921816407776563, "test_loss": 0.9033976534088909, "test_acc1": 79.68000258789063, "test_acc5": 94.942002734375, "epoch": 296, "n_parameters": 11590840}
{"train_lr": 1.0434199101065238e-05, "train_loss": 3.0970070224752626, "test_loss": 0.9022497556447069, "test_acc1": 79.7100026171875, "test_acc5": 94.92600255859375, "epoch": 297, "n_parameters": 11590840}
{"train_lr": 1.0244252618963044e-05, "train_loss": 3.10096681386018, "test_loss": 0.901076162112627, "test_acc1": 79.72600258789062, "test_acc5": 94.94600258789062, "epoch": 298, "n_parameters": 11590840}
{"train_lr": 1.0108561679951307e-05, "train_loss": 3.10676134603439, "test_loss": 0.9043267702690942, "test_acc1": 79.58200251953124, "test_acc5": 94.92000267578125, "epoch": 299, "n_parameters": 11590840}
LMMMEng commented 1 year ago

Many thanks for your rapid reply! I can only see the logs starting from 152 epoch. Is there a complete log? (0-299epochs) Thanks in advance.

yuhuan-wu commented 1 year ago

Thank you. The complete log is as below:

{"train_lr": 9.999999999999953e-07, "train_loss": 6.91375186203195, "test_loss": 6.880830226053718, "test_acc1": 0.2800000106811523, "test_acc5": 1.2460000416564943, "epoch": 0, "n_parameters": 11590840}
{"train_lr": 9.999999999999953e-07, "train_loss": 6.898907404342334, "epoch": 1, "n_parameters": 11590840}
{"train_lr": 5.0950000000000534e-05, "train_loss": 6.834851624582597, "epoch": 2, "n_parameters": 11590840}
{"train_lr": 0.00010090000000000069, "train_loss": 6.687278224409913, "epoch": 3, "n_parameters": 11590840}
{"train_lr": 0.0001508500000000009, "train_loss": 6.482151659558431, "epoch": 4, "n_parameters": 11590840}
{"train_lr": 0.0002008000000000009, "train_loss": 6.28669337050425, "epoch": 5, "n_parameters": 11590840}
{"train_lr": 0.00025075000000000097, "train_loss": 6.097037869105808, "epoch": 6, "n_parameters": 11590840}
{"train_lr": 0.0003006999999999917, "train_loss": 5.919044420778227, "epoch": 7, "n_parameters": 11590840}
{"train_lr": 0.0003506500000000049, "train_loss": 5.747095362459727, "epoch": 8, "n_parameters": 11590840}
{"train_lr": 0.000400599999999987, "train_loss": 5.584584846866312, "epoch": 9, "n_parameters": 11590840}
{"train_lr": 0.00045055000000000106, "train_loss": 5.443999265309432, "test_loss": 3.1004530309720804, "test_acc1": 37.83200105957031, "test_acc5": 63.882002041015625, "epoch": 10, "n_parameters": 11590840}
{"train_lr": 0.0005005000000000065, "train_loss": 5.305884023555082, "epoch": 11, "n_parameters": 11590840}
{"train_lr": 0.0005504499999999883, "train_loss": 5.195875042109943, "epoch": 12, "n_parameters": 11590840}
{"train_lr": 0.0006003999999999824, "train_loss": 5.093353894021776, "epoch": 13, "n_parameters": 11590840}
{"train_lr": 0.0006503500000000137, "train_loss": 5.0086642658110145, "epoch": 14, "n_parameters": 11590840}
{"train_lr": 0.0007002999999999891, "train_loss": 4.9355733668585, "epoch": 15, "n_parameters": 11590840}
{"train_lr": 0.0007502499999999804, "train_loss": 4.861918290694364, "epoch": 16, "n_parameters": 11590840}
{"train_lr": 0.0008002000000000078, "train_loss": 4.80316432512445, "epoch": 17, "n_parameters": 11590840}
{"train_lr": 0.0008501499999999945, "train_loss": 4.759728804528475, "epoch": 18, "n_parameters": 11590840}
{"train_lr": 0.0009000999999999793, "train_loss": 4.71188492378552, "epoch": 19, "n_parameters": 11590840}
{"train_lr": 0.0009500499999999993, "train_loss": 4.670197305204771, "test_loss": 2.0542636723008774, "test_acc1": 55.7800014453125, "test_acc5": 80.37400276367187, "epoch": 20, "n_parameters": 11590840}
{"train_lr": 0.000989183062363242, "train_loss": 4.621066153621216, "epoch": 21, "n_parameters": 11590840}
{"train_lr": 0.0009880787971596754, "train_loss": 4.58046535803355, "epoch": 22, "n_parameters": 11590840}
{"train_lr": 0.0009869215569222388, "train_loss": 4.51851456194854, "epoch": 23, "n_parameters": 11590840}
{"train_lr": 0.0009857114685552669, "train_loss": 4.49787975472512, "epoch": 24, "n_parameters": 11590840}
{"train_lr": 0.0009844486647586856, "train_loss": 4.4268374759897435, "epoch": 25, "n_parameters": 11590840}
{"train_lr": 0.00098313328401311, "train_loss": 4.401531908771307, "epoch": 26, "n_parameters": 11590840}
{"train_lr": 0.000981765470564834, "train_loss": 4.372993291520196, "epoch": 27, "n_parameters": 11590840}
{"train_lr": 0.0009803453744100898, "train_loss": 4.355509050791975, "epoch": 28, "n_parameters": 11590840}
{"train_lr": 0.0009788731512783825, "train_loss": 4.319360704778386, "epoch": 29, "n_parameters": 11590840}
{"train_lr": 0.00097734896261555, "train_loss": 4.284851019569247, "test_loss": 1.6648690004385154, "test_acc1": 63.20800177734375, "test_acc5": 85.79800240234376, "epoch": 30, "n_parameters": 11590840}
{"train_lr": 0.0009757729755660997, "train_loss": 4.266313350029129, "epoch": 31, "n_parameters": 11590840}
{"train_lr": 0.000974145362954742, "train_loss": 4.240361532766661, "epoch": 32, "n_parameters": 11590840}
{"train_lr": 0.000972466303267533, "train_loss": 4.224560717789294, "epoch": 33, "n_parameters": 11590840}
{"train_lr": 0.000970735980632352, "train_loss": 4.190813951688609, "epoch": 34, "n_parameters": 11590840}
{"train_lr": 0.0009689545847984843, "train_loss": 4.182979767509311, "epoch": 35, "n_parameters": 11590840}
{"train_lr": 0.000967122311116087, "train_loss": 4.172324121951295, "epoch": 36, "n_parameters": 11590840}
{"train_lr": 0.000965239360514694, "train_loss": 4.148369332059301, "epoch": 37, "n_parameters": 11590840}
{"train_lr": 0.0009633059394809262, "train_loss": 4.131543393901212, "epoch": 38, "n_parameters": 11590840}
{"train_lr": 0.0009613222600362934, "train_loss": 4.131294625077984, "epoch": 39, "n_parameters": 11590840}
{"train_lr": 0.0009592885397135909, "train_loss": 4.111912642308562, "test_loss": 1.513088011104642, "test_acc1": 65.79600184570313, "test_acc5": 87.68200236328126, "epoch": 40, "n_parameters": 11590840}
{"train_lr": 0.000957205001533091, "train_loss": 4.097033336913462, "epoch": 41, "n_parameters": 11590840}
{"train_lr": 0.0009550718739782929, "train_loss": 4.077440848024629, "epoch": 42, "n_parameters": 11590840}
{"train_lr": 0.0009528893909706948, "train_loss": 4.079473964697261, "epoch": 43, "n_parameters": 11590840}
{"train_lr": 0.0009506577918441058, "train_loss": 4.0680296417715835, "epoch": 44, "n_parameters": 11590840}
{"train_lr": 0.0009483773213185082, "train_loss": 4.054321662461062, "epoch": 45, "n_parameters": 11590840}
{"train_lr": 0.0009460482294732177, "train_loss": 4.045842117495198, "epoch": 46, "n_parameters": 11590840}
{"train_lr": 0.0009436707717194363, "train_loss": 4.036810118993886, "epoch": 47, "n_parameters": 11590840}
{"train_lr": 0.0009412452087721683, "train_loss": 4.0280411925723705, "epoch": 48, "n_parameters": 11590840}
{"train_lr": 0.0009387718066217019, "train_loss": 4.015489292325829, "epoch": 49, "n_parameters": 11590840}
{"train_lr": 0.0009362508365045029, "train_loss": 4.004816210765442, "test_loss": 1.4747849996308333, "test_acc1": 67.596001953125, "test_acc5": 88.50000263671875, "epoch": 50, "n_parameters": 11590840}
{"train_lr": 0.0009336825748732897, "train_loss": 4.004246118876765, "epoch": 51, "n_parameters": 11590840}
{"train_lr": 0.0009310673033669664, "train_loss": 3.99529940537888, "epoch": 52, "n_parameters": 11590840}
{"train_lr": 0.0009284053087794628, "train_loss": 3.9909315702917096, "epoch": 53, "n_parameters": 11590840}
{"train_lr": 0.0009256968830284788, "train_loss": 3.973643736873599, "epoch": 54, "n_parameters": 11590840}
{"train_lr": 0.0009229423231234937, "train_loss": 3.9764581869165005, "epoch": 55, "n_parameters": 11590840}
{"train_lr": 0.0009201419311329745, "train_loss": 3.9516206474708233, "epoch": 56, "n_parameters": 11590840}
{"train_lr": 0.000917296014151488, "train_loss": 3.9675430746482525, "epoch": 57, "n_parameters": 11590840}
{"train_lr": 0.0009144048842658995, "train_loss": 3.9319417109782937, "epoch": 58, "n_parameters": 11590840}
{"train_lr": 0.0009114688585212051, "train_loss": 3.9380983806056657, "epoch": 59, "n_parameters": 11590840}
{"train_lr": 0.0009084882588856559, "train_loss": 3.9370276292355704, "test_loss": 1.3946601718891667, "test_acc1": 68.48400201171874, "test_acc5": 89.23400232421875, "epoch": 60, "n_parameters": 11590840}
{"train_lr": 0.0009054634122156225, "train_loss": 3.9413685554223097, "epoch": 61, "n_parameters": 11590840}
{"train_lr": 0.0009023946502195919, "train_loss": 3.92633501226477, "epoch": 62, "n_parameters": 11590840}
{"train_lr": 0.0008992823094219693, "train_loss": 3.9151942359743646, "epoch": 63, "n_parameters": 11590840}
{"train_lr": 0.0008961267311259657, "train_loss": 3.9104802890075483, "epoch": 64, "n_parameters": 11590840}
{"train_lr": 0.0008929282613763056, "train_loss": 3.9074236472352424, "epoch": 65, "n_parameters": 11590840}
{"train_lr": 0.0008896872509212006, "train_loss": 3.892708344282292, "epoch": 66, "n_parameters": 11590840}
{"train_lr": 0.0008864040551740225, "train_loss": 3.9074251199130723, "epoch": 67, "n_parameters": 11590840}
{"train_lr": 0.0008830790341741555, "train_loss": 3.8962820676400316, "epoch": 68, "n_parameters": 11590840}
{"train_lr": 0.000879712552547612, "train_loss": 3.887681174573662, "epoch": 69, "n_parameters": 11590840}
{"train_lr": 0.0008763049794670927, "train_loss": 3.8794618038345967, "test_loss": 1.3157839142639218, "test_acc1": 69.5720022265625, "test_acc5": 89.99800265625, "epoch": 70, "n_parameters": 11590840}
{"train_lr": 0.0008728566886112912, "train_loss": 3.8778769511113063, "epoch": 71, "n_parameters": 11590840}
{"train_lr": 0.000869368058124286, "train_loss": 3.8744343915622204, "epoch": 72, "n_parameters": 11590840}
{"train_lr": 0.00086583947057361, "train_loss": 3.8596577396114573, "epoch": 73, "n_parameters": 11590840}
{"train_lr": 0.0008622713129087039, "train_loss": 3.8765251202930173, "epoch": 74, "n_parameters": 11590840}
{"train_lr": 0.0008586639764182332, "train_loss": 3.8560798949570203, "epoch": 75, "n_parameters": 11590840}
{"train_lr": 0.0008550178566873157, "train_loss": 3.855645470053172, "epoch": 76, "n_parameters": 11590840}
{"train_lr": 0.0008513333535541271, "train_loss": 3.863833758899634, "epoch": 77, "n_parameters": 11590840}
{"train_lr": 0.0008476108710658584, "train_loss": 3.8357053354299135, "epoch": 78, "n_parameters": 11590840}
{"train_lr": 0.0008438508174346799, "train_loss": 3.834356178768533, "epoch": 79, "n_parameters": 11590840}
{"train_lr": 0.0008400536049929256, "train_loss": 3.8291077033983623, "test_loss": 1.3114848655598763, "test_acc1": 70.29400217773437, "test_acc5": 90.2040024609375, "epoch": 80, "n_parameters": 11590840}
{"train_lr": 0.0008362196501476587, "train_loss": 3.833939125974306, "epoch": 81, "n_parameters": 11590840}
{"train_lr": 0.0008323493733352106, "train_loss": 3.817346933886683, "epoch": 82, "n_parameters": 11590840}
{"train_lr": 0.0008284431989751198, "train_loss": 3.823519348431167, "epoch": 83, "n_parameters": 11590840}
{"train_lr": 0.0008245015554233518, "train_loss": 3.822793663643914, "epoch": 84, "n_parameters": 11590840}
{"train_lr": 0.0008205248749255873, "train_loss": 3.8193715292868093, "epoch": 85, "n_parameters": 11590840}
{"train_lr": 0.0008165135935696843, "train_loss": 3.8143159262091517, "epoch": 86, "n_parameters": 11590840}
{"train_lr": 0.0008124681512377846, "train_loss": 3.8097015201902504, "epoch": 87, "n_parameters": 11590840}
{"train_lr": 0.0008083889915582182, "train_loss": 3.8096268920780276, "epoch": 88, "n_parameters": 11590840}
{"train_lr": 0.0008042765618568846, "train_loss": 3.7988074448564166, "epoch": 89, "n_parameters": 11590840}
{"train_lr": 0.0008001313131079711, "train_loss": 3.79829366453927, "test_loss": 1.26552754845328, "test_acc1": 70.62800227539063, "test_acc5": 90.54000248046874, "epoch": 90, "n_parameters": 11590840}
{"train_lr": 0.0007959536998847494, "train_loss": 3.7946356395832734, "epoch": 91, "n_parameters": 11590840}
{"train_lr": 0.00079174418030961, "train_loss": 3.80087052675174, "epoch": 92, "n_parameters": 11590840}
{"train_lr": 0.0007875032160038206, "train_loss": 3.7888032052156735, "epoch": 93, "n_parameters": 11590840}
{"train_lr": 0.0007832312720368116, "train_loss": 3.7822648813779787, "epoch": 94, "n_parameters": 11590840}
{"train_lr": 0.0007789288168754665, "train_loss": 3.7664834796953546, "epoch": 95, "n_parameters": 11590840}
{"train_lr": 0.0007745963223324493, "train_loss": 3.7706048905992393, "epoch": 96, "n_parameters": 11590840}
{"train_lr": 0.0007702342635146133, "train_loss": 3.7520810233222113, "epoch": 97, "n_parameters": 11590840}
{"train_lr": 0.0007658431187708383, "train_loss": 3.7598581806742413, "epoch": 98, "n_parameters": 11590840}
{"train_lr": 0.0007614233696396924, "train_loss": 3.748048622735875, "epoch": 99, "n_parameters": 11590840}
{"train_lr": 0.000756975500796447, "train_loss": 3.762699702232004, "test_loss": 1.2603078159212155, "test_acc1": 71.27800225585938, "test_acc5": 90.97800259765626, "epoch": 100, "n_parameters": 11590840}
{"train_lr": 0.000752500000000017, "train_loss": 3.7533907828380544, "epoch": 101, "n_parameters": 11590840}
{"train_lr": 0.0007479973580395145, "train_loss": 3.7439935236430757, "epoch": 102, "n_parameters": 11590840}
{"train_lr": 0.0007434680686803327, "train_loss": 3.75046694831406, "epoch": 103, "n_parameters": 11590840}
{"train_lr": 0.0007389126286101685, "train_loss": 3.7478795570435284, "epoch": 104, "n_parameters": 11590840}
{"train_lr": 0.0007343315373843358, "train_loss": 3.7336331602099606, "epoch": 105, "n_parameters": 11590840}
{"train_lr": 0.0007297252973710597, "train_loss": 3.732796702501204, "epoch": 106, "n_parameters": 11590840}
{"train_lr": 0.0007250944136965276, "train_loss": 3.724098074922173, "epoch": 107, "n_parameters": 11590840}
{"train_lr": 0.000720439394189308, "train_loss": 3.7295565173494443, "epoch": 108, "n_parameters": 11590840}
{"train_lr": 0.0007157607493246967, "train_loss": 3.736929681518381, "epoch": 109, "n_parameters": 11590840}
{"train_lr": 0.0007110589921689151, "train_loss": 3.719291471177154, "test_loss": 1.2405693537861335, "test_acc1": 71.76800212890625, "test_acc5": 91.224002734375, "epoch": 110, "n_parameters": 11590840}
{"train_lr": 0.0007063346383225066, "train_loss": 3.724559374516912, "epoch": 111, "n_parameters": 11590840}
{"train_lr": 0.0007015882058641977, "train_loss": 3.7266455433732695, "epoch": 112, "n_parameters": 11590840}
{"train_lr": 0.0006968202152938008, "train_loss": 3.704893908388228, "epoch": 113, "n_parameters": 11590840}
{"train_lr": 0.0006920311894751397, "train_loss": 3.708802143685061, "epoch": 114, "n_parameters": 11590840}
{"train_lr": 0.0006872216535789268, "train_loss": 3.7020419045603816, "epoch": 115, "n_parameters": 11590840}
{"train_lr": 0.0006823921350249367, "train_loss": 3.6846609861730673, "epoch": 116, "n_parameters": 11590840}
{"train_lr": 0.0006775431634242845, "train_loss": 3.6996953993392507, "epoch": 117, "n_parameters": 11590840}
{"train_lr": 0.0006726752705214027, "train_loss": 3.7110450157253956, "epoch": 118, "n_parameters": 11590840}
{"train_lr": 0.0006677889901356114, "train_loss": 3.698194497447315, "epoch": 119, "n_parameters": 11590840}
{"train_lr": 0.000662884858102535, "train_loss": 3.687903973124296, "test_loss": 1.2118800216503727, "test_acc1": 72.6640021484375, "test_acc5": 91.57800248046875, "epoch": 120, "n_parameters": 11590840}
{"train_lr": 0.0006579634122155855, "train_loss": 3.6782316001389717, "epoch": 121, "n_parameters": 11590840}
{"train_lr": 0.0006530251921667783, "train_loss": 3.6809611139918785, "epoch": 122, "n_parameters": 11590840}
{"train_lr": 0.0006480707394875039, "train_loss": 3.6690469205522422, "epoch": 123, "n_parameters": 11590840}
{"train_lr": 0.0006431005974894059, "train_loss": 3.6738014769115797, "epoch": 124, "n_parameters": 11590840}
{"train_lr": 0.000638115311204536, "train_loss": 3.6529184020489907, "epoch": 125, "n_parameters": 11590840}
{"train_lr": 0.0006331154273257683, "train_loss": 3.663861138214596, "epoch": 126, "n_parameters": 11590840}
{"train_lr": 0.0006281014941466044, "train_loss": 3.650975069077276, "epoch": 127, "n_parameters": 11590840}
{"train_lr": 0.0006230740615014092, "train_loss": 3.655613499198505, "epoch": 128, "n_parameters": 11590840}
{"train_lr": 0.0006180336807047859, "train_loss": 3.6478796570802285, "epoch": 129, "n_parameters": 11590840}
{"train_lr": 0.0006129809044912789, "train_loss": 3.6521722498081095, "test_loss": 1.1903528264005676, "test_acc1": 72.81800240234375, "test_acc5": 91.8220021484375, "epoch": 130, "n_parameters": 11590840}
{"train_lr": 0.0006079162869547816, "train_loss": 3.6466257963344444, "epoch": 131, "n_parameters": 11590840}
{"train_lr": 0.0006028403834876773, "train_loss": 3.651870818828031, "epoch": 132, "n_parameters": 11590840}
{"train_lr": 0.0005977537507199184, "train_loss": 3.639648040326284, "epoch": 133, "n_parameters": 11590840}
{"train_lr": 0.0005926569464581886, "train_loss": 3.639648497581101, "epoch": 134, "n_parameters": 11590840}
{"train_lr": 0.0005875505296244566, "train_loss": 3.649845886859391, "epoch": 135, "n_parameters": 11590840}
{"train_lr": 0.0005824350601949218, "train_loss": 3.633724526202174, "epoch": 136, "n_parameters": 11590840}
{"train_lr": 0.0005773110991383895, "train_loss": 3.625698553620101, "epoch": 137, "n_parameters": 11590840}
{"train_lr": 0.0005721792083549796, "train_loss": 3.6235382628860138, "epoch": 138, "n_parameters": 11590840}
{"train_lr": 0.0005670399506143468, "train_loss": 3.608216544659399, "epoch": 139, "n_parameters": 11590840}
{"train_lr": 0.0005618938894939768, "train_loss": 3.6187517349001506, "test_loss": 1.1677357581735568, "test_acc1": 73.75200250976563, "test_acc5": 92.30000248046875, "epoch": 140, "n_parameters": 11590840}
{"train_lr": 0.0005567415893175015, "train_loss": 3.621376308034078, "epoch": 141, "n_parameters": 11590840}
{"train_lr": 0.0005515836150926655, "train_loss": 3.595934270144843, "epoch": 142, "n_parameters": 11590840}
{"train_lr": 0.0005464205324494959, "train_loss": 3.6132679593077093, "epoch": 143, "n_parameters": 11590840}
{"train_lr": 0.0005412529075781865, "train_loss": 3.617913811422176, "epoch": 144, "n_parameters": 11590840}
{"train_lr": 0.0005360813071670112, "train_loss": 3.6029662199729353, "epoch": 145, "n_parameters": 11590840}
{"train_lr": 0.0005309062983402611, "train_loss": 3.602208800500722, "epoch": 146, "n_parameters": 11590840}
{"train_lr": 0.0005257284485959572, "train_loss": 3.5884025637194408, "epoch": 147, "n_parameters": 11590840}
{"train_lr": 0.000520548325743666, "train_loss": 3.5758108694871646, "epoch": 148, "n_parameters": 11590840}
{"train_lr": 0.0005153664978422618, "train_loss": 3.5916598458274853, "epoch": 149, "n_parameters": 11590840}
{"train_lr": 0.0005101835331375457, "train_loss": 3.592105409271902, "test_loss": 1.110040875336596, "test_acc1": 74.15000254882813, "test_acc5": 92.4580026171875, "epoch": 150, "n_parameters": 11590840}
{"train_lr": 0.0005049999999999881, "train_loss": 3.5801324951086495, "epoch": 151, "n_parameters": 11590840}
{"train_lr": 0.0004998164668624689, "train_loss": 3.5743932656104045, "epoch": 152, "n_parameters": 11590840}
{"train_lr": 0.0004946335021577291, "train_loss": 3.5612054719722908, "epoch": 153, "n_parameters": 11590840}
{"train_lr": 0.0004894516742563275, "train_loss": 3.56016564652693, "epoch": 154, "n_parameters": 11590840}
{"train_lr": 0.0004842715514040588, "train_loss": 3.5652903384275194, "epoch": 155, "n_parameters": 11590840}
{"train_lr": 0.00047909370165973703, "train_loss": 3.5613126419097494, "epoch": 156, "n_parameters": 11590840}
{"train_lr": 0.00047391869283298395, "train_loss": 3.555761627132277, "epoch": 157, "n_parameters": 11590840}
{"train_lr": 0.0004687470924218306, "train_loss": 3.554628330883648, "epoch": 158, "n_parameters": 11590840}
{"train_lr": 0.0004635794675504931, "train_loss": 3.535118740906628, "epoch": 159, "n_parameters": 11590840}
{"train_lr": 0.0004584163849073357, "train_loss": 3.5404944376265117, "test_loss": 1.1421642118486866, "test_acc1": 74.286002109375, "test_acc5": 92.60000264648437, "epoch": 160, "n_parameters": 11590840}
{"train_lr": 0.0004532584106825094, "train_loss": 3.553695835226731, "epoch": 161, "n_parameters": 11590840}
{"train_lr": 0.0004481061105060409, "train_loss": 3.54094214683814, "epoch": 162, "n_parameters": 11590840}
{"train_lr": 0.00044296004938566933, "train_loss": 3.523124451176535, "epoch": 163, "n_parameters": 11590840}
{"train_lr": 0.0004378207916450146, "train_loss": 3.5339377294484375, "epoch": 164, "n_parameters": 11590840}
{"train_lr": 0.00043268890086160794, "train_loss": 3.515914784942409, "epoch": 165, "n_parameters": 11590840}
{"train_lr": 0.00042756493980507503, "train_loss": 3.517750824431626, "epoch": 166, "n_parameters": 11590840}
{"train_lr": 0.0004224494703755201, "train_loss": 3.5138288475126385, "epoch": 167, "n_parameters": 11590840}
{"train_lr": 0.00041734305354179923, "train_loss": 3.5127825907570758, "epoch": 168, "n_parameters": 11590840}
{"train_lr": 0.0004122462492800569, "train_loss": 3.5142803016207296, "epoch": 169, "n_parameters": 11590840}
{"train_lr": 0.0004071596165123325, "train_loss": 3.5071192180557693, "test_loss": 1.1510255648486916, "test_acc1": 74.8600022265625, "test_acc5": 92.81800235351562, "epoch": 170, "n_parameters": 11590840}
{"train_lr": 0.0004020837130452182, "train_loss": 3.5065351560247318, "epoch": 171, "n_parameters": 11590840}
{"train_lr": 0.00039701909550871056, "train_loss": 3.4980822681642167, "epoch": 172, "n_parameters": 11590840}
{"train_lr": 0.0003919663192952229, "train_loss": 3.4923600166035498, "epoch": 173, "n_parameters": 11590840}
{"train_lr": 0.00038692593849859734, "train_loss": 3.48667043068712, "epoch": 174, "n_parameters": 11590840}
{"train_lr": 0.0003818985058534034, "train_loss": 3.4858136311423578, "epoch": 175, "n_parameters": 11590840}
{"train_lr": 0.0003768845726742596, "train_loss": 3.4909114401808363, "epoch": 176, "n_parameters": 11590840}
{"train_lr": 0.0003718846887954506, "train_loss": 3.480566321898231, "epoch": 177, "n_parameters": 11590840}
{"train_lr": 0.00036689940251057154, "train_loss": 3.4779263569487275, "epoch": 178, "n_parameters": 11590840}
{"train_lr": 0.0003619292605124837, "train_loss": 3.471419211474063, "epoch": 179, "n_parameters": 11590840}
{"train_lr": 0.0003569748078332423, "train_loss": 3.471359407158493, "test_loss": 1.0835051923647694, "test_acc1": 75.71200217773438, "test_acc5": 93.12200203125, "epoch": 180, "n_parameters": 11590840}
{"train_lr": 0.00035203658778439235, "train_loss": 3.46441358041992, "epoch": 181, "n_parameters": 11590840}
{"train_lr": 0.0003471151418974503, "train_loss": 3.455791411687621, "epoch": 182, "n_parameters": 11590840}
{"train_lr": 0.0003422110098644085, "train_loss": 3.4613385772009453, "epoch": 183, "n_parameters": 11590840}
{"train_lr": 0.0003373247294785742, "train_loss": 3.4543474361860302, "epoch": 184, "n_parameters": 11590840}
{"train_lr": 0.0003324568365756944, "train_loss": 3.4412438851847447, "epoch": 185, "n_parameters": 11590840}
{"train_lr": 0.00032760786497508304, "train_loss": 3.4446582230542013, "epoch": 186, "n_parameters": 11590840}
{"train_lr": 0.00032277834642108216, "train_loss": 3.447267726123285, "epoch": 187, "n_parameters": 11590840}
{"train_lr": 0.00031796881052486697, "train_loss": 3.4390437261854334, "epoch": 188, "n_parameters": 11590840}
{"train_lr": 0.0003131797847062025, "train_loss": 3.437155818207849, "epoch": 189, "n_parameters": 11590840}
{"train_lr": 0.0003084117941357799, "train_loss": 3.436713810316283, "test_loss": 1.0393913918071322, "test_acc1": 75.97800221679688, "test_acc5": 93.26800233398437, "epoch": 190, "n_parameters": 11590840}
{"train_lr": 0.00030366536167747524, "train_loss": 3.423190413797788, "epoch": 191, "n_parameters": 11590840}
{"train_lr": 0.00029894100783110664, "train_loss": 3.422523183966998, "epoch": 192, "n_parameters": 11590840}
{"train_lr": 0.0002942392506752879, "train_loss": 3.4283898516977245, "epoch": 193, "n_parameters": 11590840}
{"train_lr": 0.0002895606058107011, "train_loss": 3.4096099771708133, "epoch": 194, "n_parameters": 11590840}
{"train_lr": 0.0002849055863034561, "train_loss": 3.4102377335873726, "epoch": 195, "n_parameters": 11590840}
{"train_lr": 0.0002802747026289244, "train_loss": 3.401014165412322, "epoch": 196, "n_parameters": 11590840}
{"train_lr": 0.00027566846261567343, "train_loss": 3.401992839994095, "epoch": 197, "n_parameters": 11590840}
{"train_lr": 0.00027108737138981483, "train_loss": 3.4068873604591325, "epoch": 198, "n_parameters": 11590840}
{"train_lr": 0.00026653193131964784, "train_loss": 3.3921055701567973, "epoch": 199, "n_parameters": 11590840}
{"train_lr": 0.00026200264196050673, "train_loss": 3.3936248188682026, "test_loss": 1.0303690580801033, "test_acc1": 76.59400234863281, "test_acc5": 93.540002421875, "epoch": 200, "n_parameters": 11590840}
{"train_lr": 0.00025750000000000143, "train_loss": 3.389856409016464, "epoch": 201, "n_parameters": 11590840}
{"train_lr": 0.0002530244992035622, "train_loss": 3.3846808571895535, "epoch": 202, "n_parameters": 11590840}
{"train_lr": 0.00024857663036030185, "train_loss": 3.372419533814839, "epoch": 203, "n_parameters": 11590840}
{"train_lr": 0.00024415688122914167, "train_loss": 3.3688626083776914, "epoch": 204, "n_parameters": 11590840}
{"train_lr": 0.00023976573648539732, "train_loss": 3.362280931832979, "epoch": 205, "n_parameters": 11590840}
{"train_lr": 0.0002354036776675575, "train_loss": 3.364592643116685, "epoch": 206, "n_parameters": 11590840}
{"train_lr": 0.00023107118312454297, "train_loss": 3.3697193838137802, "epoch": 207, "n_parameters": 11590840}
{"train_lr": 0.00022676872796319747, "train_loss": 3.372238050464341, "epoch": 208, "n_parameters": 11590840}
{"train_lr": 0.00022249678399621184, "train_loss": 3.3461100655398686, "epoch": 209, "n_parameters": 11590840}
{"train_lr": 0.00021825581969037202, "train_loss": 3.3540834311148724, "test_loss": 1.0081777131877183, "test_acc1": 77.00800224121093, "test_acc5": 93.76800241210937, "epoch": 210, "n_parameters": 11590840}
{"train_lr": 0.0002140463001152288, "train_loss": 3.3574369996667004, "epoch": 211, "n_parameters": 11590840}
{"train_lr": 0.00020986868689201908, "train_loss": 3.3426170350312234, "epoch": 212, "n_parameters": 11590840}
{"train_lr": 0.00020572343814312388, "train_loss": 3.3400714560259255, "epoch": 213, "n_parameters": 11590840}
{"train_lr": 0.00020161100844177222, "train_loss": 3.3358767846291015, "epoch": 214, "n_parameters": 11590840}
{"train_lr": 0.0001975318487622333, "train_loss": 3.3342676946489838, "epoch": 215, "n_parameters": 11590840}
{"train_lr": 0.0001934864064303279, "train_loss": 3.3346634693830897, "epoch": 216, "n_parameters": 11590840}
{"train_lr": 0.00018947512507439562, "train_loss": 3.3282264391485925, "epoch": 217, "n_parameters": 11590840}
{"train_lr": 0.00018549844457663907, "train_loss": 3.3229453079372666, "epoch": 218, "n_parameters": 11590840}
{"train_lr": 0.00018155680102489234, "train_loss": 3.323198780619936, "epoch": 219, "n_parameters": 11590840}
{"train_lr": 0.00017765062666479713, "train_loss": 3.315860111698163, "test_loss": 0.978888113950861, "test_acc1": 77.4140024609375, "test_acc5": 93.85800254882813, "epoch": 220, "n_parameters": 11590840}
{"train_lr": 0.0001737803498523614, "train_loss": 3.320668717177747, "epoch": 221, "n_parameters": 11590840}
{"train_lr": 0.0001699463950070852, "train_loss": 3.3042723018702844, "epoch": 222, "n_parameters": 11590840}
{"train_lr": 0.00016614918256530037, "train_loss": 3.297677642006954, "epoch": 223, "n_parameters": 11590840}
{"train_lr": 0.0001623891289341519, "train_loss": 3.3023981789080836, "epoch": 224, "n_parameters": 11590840}
{"train_lr": 0.00015866664644587948, "train_loss": 3.2876113930003914, "epoch": 225, "n_parameters": 11590840}
{"train_lr": 0.00015498214331266302, "train_loss": 3.287639162987828, "epoch": 226, "n_parameters": 11590840}
{"train_lr": 0.00015133602358175863, "train_loss": 3.2908601643060513, "epoch": 227, "n_parameters": 11590840}
{"train_lr": 0.00014772868709131358, "train_loss": 3.281462552426435, "epoch": 228, "n_parameters": 11590840}
{"train_lr": 0.00014416052942639815, "train_loss": 3.285233736038208, "epoch": 229, "n_parameters": 11590840}
{"train_lr": 0.0001406319418757283, "train_loss": 3.26800053134906, "test_loss": 0.9623063921471665, "test_acc1": 77.96600250976563, "test_acc5": 94.17400252929687, "epoch": 230, "n_parameters": 11590840}
{"train_lr": 0.00013714331138869076, "train_loss": 3.2689359839037837, "epoch": 231, "n_parameters": 11590840}
{"train_lr": 0.00013369502053292505, "train_loss": 3.2619058941837125, "epoch": 232, "n_parameters": 11590840}
{"train_lr": 0.00013028744745238475, "train_loss": 3.255431350055549, "epoch": 233, "n_parameters": 11590840}
{"train_lr": 0.0001269209658258495, "train_loss": 3.2588716568373184, "epoch": 234, "n_parameters": 11590840}
{"train_lr": 0.0001235959448259827, "train_loss": 3.252390821012471, "epoch": 235, "n_parameters": 11590840}
{"train_lr": 0.00012031274907879949, "train_loss": 3.2490729571437, "epoch": 236, "n_parameters": 11590840}
{"train_lr": 0.00011707173862371172, "train_loss": 3.245385940757682, "epoch": 237, "n_parameters": 11590840}
{"train_lr": 0.00011387326887403272, "train_loss": 3.2360426683386834, "epoch": 238, "n_parameters": 11590840}
{"train_lr": 0.00011071769057802179, "train_loss": 3.234613025026451, "epoch": 239, "n_parameters": 11590840}
{"train_lr": 0.00010760534978039747, "train_loss": 3.2343959636348996, "test_loss": 0.9611213823844647, "test_acc1": 78.19800240234375, "test_acc5": 94.37800247070312, "epoch": 240, "n_parameters": 11590840}
{"train_lr": 0.00010453658778440302, "train_loss": 3.2259613303281514, "epoch": 241, "n_parameters": 11590840}
{"train_lr": 0.0001015117411143637, "train_loss": 3.2356295629108933, "epoch": 242, "n_parameters": 11590840}
{"train_lr": 9.853114147881454e-05, "train_loss": 3.231938961360285, "epoch": 243, "n_parameters": 11590840}
{"train_lr": 9.559511573409267e-05, "train_loss": 3.226304765347001, "epoch": 244, "n_parameters": 11590840}
{"train_lr": 9.270398584849976e-05, "train_loss": 3.2198906852592, "epoch": 245, "n_parameters": 11590840}
{"train_lr": 8.985806886701763e-05, "train_loss": 3.207187651885118, "epoch": 246, "n_parameters": 11590840}
{"train_lr": 8.705767687650155e-05, "train_loss": 3.212325418857838, "epoch": 247, "n_parameters": 11590840}
{"train_lr": 8.430311697149993e-05, "train_loss": 3.207398280441809, "epoch": 248, "n_parameters": 11590840}
{"train_lr": 8.159469122054745e-05, "train_loss": 3.20298746091237, "epoch": 249, "n_parameters": 11590840}
{"train_lr": 7.893269663304691e-05, "train_loss": 3.195464231282306, "test_loss": 0.9325173419554115, "test_acc1": 78.83600243652344, "test_acc5": 94.46600252929687, "epoch": 250, "n_parameters": 11590840}
{"train_lr": 7.631742512670381e-05, "train_loss": 3.201913250888661, "epoch": 251, "n_parameters": 11590840}
{"train_lr": 7.37491634955067e-05, "train_loss": 3.19635313899397, "epoch": 252, "n_parameters": 11590840}
{"train_lr": 7.122819337828824e-05, "train_loss": 3.1866823428040214, "epoch": 253, "n_parameters": 11590840}
{"train_lr": 6.875479122782832e-05, "train_loss": 3.180942725625446, "epoch": 254, "n_parameters": 11590840}
{"train_lr": 6.632922828055003e-05, "train_loss": 3.1880086358431146, "epoch": 255, "n_parameters": 11590840}
{"train_lr": 6.395177052675899e-05, "train_loss": 3.1830955958075755, "epoch": 256, "n_parameters": 11590840}
{"train_lr": 6.162267868149164e-05, "train_loss": 3.1728168588271624, "epoch": 257, "n_parameters": 11590840}
{"train_lr": 5.934220815591336e-05, "train_loss": 3.1835445344066926, "epoch": 258, "n_parameters": 11590840}
{"train_lr": 5.7110609029320316e-05, "train_loss": 3.174825050657411, "epoch": 259, "n_parameters": 11590840}
{"train_lr": 5.492812602170232e-05, "train_loss": 3.166238883964354, "test_loss": 0.939041365266303, "test_acc1": 78.96000243652344, "test_acc5": 94.67000270507812, "epoch": 260, "n_parameters": 11590840}
{"train_lr": 5.2794998466913834e-05, "train_loss": 3.167985329882418, "epoch": 261, "n_parameters": 11590840}
{"train_lr": 5.0711460286429e-05, "train_loss": 3.1589204837783256, "epoch": 262, "n_parameters": 11590840}
{"train_lr": 4.8677739963691566e-05, "train_loss": 3.1595042114444585, "epoch": 263, "n_parameters": 11590840}
{"train_lr": 4.669406051905346e-05, "train_loss": 3.16147943028062, "epoch": 264, "n_parameters": 11590840}
{"train_lr": 4.476063948531632e-05, "train_loss": 3.1560502262543335, "epoch": 265, "n_parameters": 11590840}
{"train_lr": 4.287768888388454e-05, "train_loss": 3.1515720459721166, "epoch": 266, "n_parameters": 11590840}
{"train_lr": 4.1045415201513915e-05, "train_loss": 3.1459253384864967, "epoch": 267, "n_parameters": 11590840}
{"train_lr": 3.926401936765786e-05, "train_loss": 3.147256157571654, "epoch": 268, "n_parameters": 11590840}
{"train_lr": 3.753369673244709e-05, "train_loss": 3.145818961026381, "epoch": 269, "n_parameters": 11590840}
{"train_lr": 3.585463704525412e-05, "train_loss": 3.142557013354046, "test_loss": 0.9168021891308927, "test_acc1": 79.23400267089843, "test_acc5": 94.70600247070313, "epoch": 270, "n_parameters": 11590840}
{"train_lr": 3.4227024433899046e-05, "train_loss": 3.1439421949483792, "epoch": 271, "n_parameters": 11590840}
{"train_lr": 3.2651037384443736e-05, "train_loss": 3.1346061845644297, "epoch": 272, "n_parameters": 11590840}
{"train_lr": 3.112684872162703e-05, "train_loss": 3.135633980567983, "epoch": 273, "n_parameters": 11590840}
{"train_lr": 2.9654625589913e-05, "train_loss": 3.1337282281461283, "epoch": 274, "n_parameters": 11590840}
{"train_lr": 2.8234529435159726e-05, "train_loss": 3.1363811073043077, "epoch": 275, "n_parameters": 11590840}
{"train_lr": 2.686671598691151e-05, "train_loss": 3.1295296204842917, "epoch": 276, "n_parameters": 11590840}
{"train_lr": 2.55513352413271e-05, "train_loss": 3.123510868655597, "epoch": 277, "n_parameters": 11590840}
{"train_lr": 2.4288531444729954e-05, "train_loss": 3.126117390849226, "epoch": 278, "n_parameters": 11590840}
{"train_lr": 2.3078443077785567e-05, "train_loss": 3.1179381135603985, "epoch": 279, "n_parameters": 11590840}
{"train_lr": 2.192120284031953e-05, "train_loss": 3.12479729363196, "test_loss": 0.9034121544653428, "test_acc1": 79.43600257324219, "test_acc5": 94.82000252929687, "epoch": 280, "n_parameters": 11590840}
{"train_lr": 2.0816937636766537e-05, "train_loss": 3.1183729855229054, "epoch": 281, "n_parameters": 11590840}
{"train_lr": 1.976576856224742e-05, "train_loss": 3.1258816430918412, "epoch": 282, "n_parameters": 11590840}
{"train_lr": 1.8767810889299472e-05, "train_loss": 3.1130714507387887, "epoch": 283, "n_parameters": 11590840}
{"train_lr": 1.7823174055225238e-05, "train_loss": 3.1153017404339582, "epoch": 284, "n_parameters": 11590840}
{"train_lr": 1.6931961650100128e-05, "train_loss": 3.1178496638647943, "epoch": 285, "n_parameters": 11590840}
{"train_lr": 1.609427140540658e-05, "train_loss": 3.1148206856396556, "epoch": 286, "n_parameters": 11590840}
{"train_lr": 1.5310195183320784e-05, "train_loss": 3.1152974600962504, "epoch": 287, "n_parameters": 11590840}
{"train_lr": 1.4579818966635116e-05, "train_loss": 3.109842925179872, "epoch": 288, "n_parameters": 11590840}
{"train_lr": 1.3903222849333426e-05, "train_loss": 3.114564793251401, "epoch": 289, "n_parameters": 11590840}
{"train_lr": 1.3280481027803718e-05, "train_loss": 3.1118165004453497, "test_loss": 0.8950902612958375, "test_acc1": 79.62200249023438, "test_acc5": 94.91200266601562, "epoch": 290, "n_parameters": 11590840}
{"train_lr": 1.2711661792704427e-05, "train_loss": 3.1051136378785498, "test_loss": 0.9018061433943753, "test_acc1": 79.5720026953125, "test_acc5": 94.87200235351563, "epoch": 291, "n_parameters": 11590840}
{"train_lr": 1.2196827521475628e-05, "train_loss": 3.109409333704759, "test_loss": 0.9035136366826821, "test_acc1": 79.66800276855469, "test_acc5": 94.89800244140625, "epoch": 292, "n_parameters": 11590840}
{"train_lr": 1.1736034671495227e-05, "train_loss": 3.114151931566586, "test_loss": 0.9021170568648883, "test_acc1": 79.62600262207032, "test_acc5": 94.86000256835938, "epoch": 293, "n_parameters": 11590840}
{"train_lr": 1.1329333773893123e-05, "train_loss": 3.106837773280178, "test_loss": 0.8939082184057127, "test_acc1": 79.76000228027344, "test_acc5": 94.88600251953125, "epoch": 294, "n_parameters": 11590840}
{"train_lr": 1.0976769428005425e-05, "train_loss": 3.1012234592275747, "test_loss": 0.8985905765459455, "test_acc1": 79.67400261230469, "test_acc5": 94.93800259765625, "epoch": 295, "n_parameters": 11590840}
{"train_lr": 1.067838029648576e-05, "train_loss": 3.0921816407776563, "test_loss": 0.9033976534088909, "test_acc1": 79.68000258789063, "test_acc5": 94.942002734375, "epoch": 296, "n_parameters": 11590840}
{"train_lr": 1.0434199101065238e-05, "train_loss": 3.0970070224752626, "test_loss": 0.9022497556447069, "test_acc1": 79.7100026171875, "test_acc5": 94.92600255859375, "epoch": 297, "n_parameters": 11590840}
{"train_lr": 1.0244252618963044e-05, "train_loss": 3.10096681386018, "test_loss": 0.901076162112627, "test_acc1": 79.72600258789062, "test_acc5": 94.94600258789062, "epoch": 298, "n_parameters": 11590840}
{"train_lr": 1.0108561679951307e-05, "train_loss": 3.10676134603439, "test_loss": 0.9043267702690942, "test_acc1": 79.58200251953124, "test_acc5": 94.92000267578125, "epoch": 299, "n_parameters": 11590840}
LMMMEng commented 1 year ago

Many thanks!