mljar / mljar-supervised

Python package for AutoML on Tabular Data with Feature Engineering, Hyper-Parameters Tuning, Explanations and Automatic Documentation
https://mljar.com
MIT License
2.97k stars 392 forks source link

leaderboard is showing negative values for f1 metric #580

Open yairVanti opened 1 year ago

yairVanti commented 1 year ago

when i run fit on tabular binary classification dataset i get the following leader board image

when i run model.report() it looks better , where the metrics are positive:

image

pplonski commented 1 year ago

Looks like a bug. Thank you for reporting!

aksy2512 commented 1 year ago

Hi, I want to work on this issue

pplonski commented 1 year ago

Hi @aksy2512, thanks, it is all yours!

The first step would be to reproduce the issue. Are you able to reproduce it locally with some fake data?

aksy2512 commented 1 year ago

Thank you @pplonski for assigning me the issue. I am not able to reproduce the data locally. It would be great if you could suggest me some resources about it

pplonski commented 1 year ago

Thanks @aksy2512 for looking into it.

@yairVanti could you please provide example data and code to reproduce the issue locally?

yairVanti commented 1 year ago

dataset is attached, target is the last column. electrical_grid.csv

just train AutoML with these params : ("total_time_limit": 120, "ml_task": "binary_classification", "mode": "Perform", "eval_metric": "f1", "n_jobs": -1, "random_state": 0) and get_leaderboard() on the fitted AutoML instance.

pplonski commented 1 year ago

Thank you @yairVanti!