Closed hengzhe-zhang closed 1 year ago
Hey @hengzhe-zhang !
For cross-validation, GP-GOMEA uses the negative mean squared error: neg_mse = - MEAN_i[ (y_i - p_i)**2 ]
. This is because sklearn
wants a scoring metric where higher is better.
If I remember right, this is (or maybe was) a standard choice for scoring regression models in sklearn
.
So, to convert the scores to mean squared errors, you can multiply by -1. Do the numbers look right to you if you do that? Or do you think the numbers are still strange for this dataset?
Wow. Thanks a lot. I have changed the scoring function, and it works well now.
I found a very weird phenomenon in the cross-validation mode. Using the cross-validation function in scikit-learn will make GP-GOMEA perform very badly. I believe it is a bug. Are you willing to have a check on this issue? Thanks!
Here is a minimal example: