Closed faseelmo closed 10 months ago
Train for 100 Epochs using MAE (L1) then use that weights to train on MSE (L2)
MAE and RMSE giving pretty much the same results.
Once MAE reaches its local minima try training using that weights and lowering the learning rate.
Model is overfitting if we train further than 200 Epoch
Train for 100 Epochs using MAE (L1) then use that weights to train on MSE (L2)