Closed aditya1503 closed 2 weeks ago
Please provide 2 Colab notebook links so we can reproduce this.
To me, it looks like your 0.7 run was not using the same data generation logic, and did not include the noise. With the noise added, it should be impossible to achieve the score reported.
Additionally, the code provided does not use fixed seeds in both the generation and train/test split, making the scores not comparable.
Running with AutoGluon 1.1.0 I get good results:
R2 Score test: 0.9962127006040331
MSE test: 137.0034214286692
R2 Score train: 0.9916397365538735
MSE train: 493.6924675927423
Because you create a few outlier samples with much larger x/y, those dominate the loss calculations, and since the seed is not fixed, this results in the difference between runs.
Please re-open the issue if you still find major differences after resolving the above issues.
Bug Report Checklist
Describe the bug When running the regression task multiple times with the equation y = 2*x+5, Autogluon 1.1.0 consistently performs worse compared to Autogluon 0.7.0.
Expected behavior I expected Autogluon 1.1.0 to perform at least as well as Autogluon 0.7.0 on this simple mathematical function regression task. (with a high R2_score)
To Reproduce I've prepared code snippets to reproduce the issue. Please find them below. Generate dataset + run AutoGluon Tabular:
Screenshots / Logs AutoGluon 0.7.0's performance
AutoGluon 1.1.0's performance
Installed Versions