Trusted-AI / AIF360

A comprehensive set of fairness metrics for datasets and machine learning models, explanations for these metrics, and algorithms to mitigate bias in datasets and models.
https://aif360.res.ibm.com/
Apache License 2.0
2.42k stars 827 forks source link

Add support for diabetes dataset #348

Open anupamamurthi opened 2 years ago

anupamamurthi commented 2 years ago

dataset location: https://archive.ics.uci.edu/ml/datasets/diabetes+130-us+hospitals+for+years+1999-2008

Potential Tasks:

hakimamarullah commented 2 years ago

Hi, @anupamamurthi. I am new to contributing to open source. I would like to work on this one.

hakimamarullah commented 2 years ago

Hi, @anupamamurthi. I've got some troubles implementing this feature. standardize_dataset returned 0 row but kept the columns. here is the test result:

expected dataset.X.shape to be (101766, 48) actual (0, 48)

I thinks it's because of dropna set to True, but when I changed dropna to False, it returned (101766, 13). How do I fix this ?

hakimamarullah commented 1 year ago

Potential task check: