Trusted-AI / AIF360

A comprehensive set of fairness metrics for datasets and machine learning models, explanations for these metrics, and algorithms to mitigate bias in datasets and models.
https://aif360.res.ibm.com/
Apache License 2.0
2.4k stars 827 forks source link

Corrected german credit data #325

Open nrkarthikeyan opened 2 years ago

nrkarthikeyan commented 2 years ago

The widely used german credit data (that is already available in the toolkit) apparently has coding errors, so consider including https://archive.ics.uci.edu/ml/datasets/South+German+Credit+%28UPDATE%29

http://www1.beuth-hochschule.de/FB_II/reports/Report-2019-004.pdf

nrkarthikeyan commented 1 year ago

Tasks:

Ricardo-OB commented 1 year ago

I was working on Colab and also ran into this error on the German Credit notebook, aif360 gave me instructions to download two files and move them to a folder. It was solved by running this code:

%pip install wget
import wget, os

output_directory = os.path.join("/usr/local/lib/python3.8/dist-packages/aif360/data/raw/german")

german_data_url = "https://archive.ics.uci.edu/ml/machine-learning-databases/statlog/german/german.data"
german_doc_url = "https://archive.ics.uci.edu/ml/machine-learning-databases/statlog/german/german.doc"

german_data = wget.download(german_data_url, out=output_directory)
german_doc = wget.download(german_doc_url, out=output_directory)