pbiecek / InterpretableMachineLearning2020

Lecture notes for 'Interpretable Machine Learning' at WUT and UoW. Summer semester 2019/2020
27 stars 27 forks source link

Dataset: COMPAS Recidivism Algorithm #8

Open pbiecek opened 4 years ago

pbiecek commented 4 years ago

Problem

This is a regression problem. On the basis of historical data, models (of varying degrees of complexity) should be developed to predict the COMPAS scores. The best models should be explained using XAI tools at the instance level and at the data set level.

Data

COMPAS Recidivism Risk Scores are suspected of being biased https://www.propublica.org/article/how-we-analyzed-the-compas-recidivism-algorithm. Let's check this. The data can be downloaded from the propublica website. https://github.com/propublica/compas-analysis

Example solution

An interesting description of performed analysis can be found here: https://github.com/propublica/compas-analysis/blob/master/Compas%20Analysis.ipynb and here https://www.propublica.org/article/how-we-analyzed-the-compas-recidivism-algorithm

marta-generation commented 4 years ago

So much can be done using this flagship data set for ML fairness! If anyone's interested in this project - ping me on Slack, happy to support :)