Trusted-AI / AIF360

A comprehensive set of fairness metrics for datasets and machine learning models, explanations for these metrics, and algorithms to mitigate bias in datasets and models.
https://aif360.res.ibm.com/
Apache License 2.0
2.46k stars 840 forks source link

Download LawSchool dataset directly from SEAPHE #359

Closed hoffmansc closed 9 months ago

hoffmansc commented 2 years ago

http://www.seaphe.org/databases.php

This way we can remove the dependency on tempeh. We can essentially copy this file (preserving the copyright notice): https://github.com/microsoft/tempeh/blob/main/tempeh/datasets/seaphe_datasets.py

See also meps_datasets.py for another example of downloading/unzipping.

Relevant files: tempeh_datasets.py law_school_gpa_dataset.py

See demo_grid_search_reduction_regression_sklearn.ipynb for example usage.

Behavior should be essentially the same as tempeh except dropping of NAs can be handled later so these should be kept.

anupamamurthi commented 2 years ago

Possible Tasks:

EktaBhaskar commented 1 year ago

please assign me this issue.

vandanapathare commented 1 year ago

Can I get this issue assigned