EpistasisLab / pmlb

PMLB: A large, curated repository of benchmark datasets for evaluating supervised machine learning algorithms.
https://epistasislab.github.io/pmlb/
MIT License
805 stars 135 forks source link

Bug in Breast Cancer Wisconsin dataset #159

Open mrektor opened 3 years ago

mrektor commented 3 years ago

Hi everybody.

I found a small bug when fetching Breast Cancer Wisconsin dataset: The target class is not the last column but the first one.

This can be solved in code by simply querying for the "target" column, but for consistency I think should be fair to consider moving the target class to the last column.

trangdata commented 3 years ago

Thanks @mrektor for reporting this. We're open for PRs if you'd like to submit one to fix this issue. Thanks so much! 🙏🏽 🌻