Datasets + Modele cu bias + Metode de interpretare + POC

tudorcebere commented 4 years ago

Ideal, ar trebui luata informatia relevanta de aici pentru a vedea PoC-urile valoroase pe care am putea sa le facem si pe ce: https://github.com/pbiecek/xai_resources

[x] Ce dataset-uri cu bias exista? Daca nu avem unul public, putem genera unul?
[ ] Exista algoritmi cu bias?
[ ] PoC de interes?

andreimano commented 4 years ago

Mă pot ocupa eu de asta

bcebere commented 4 years ago

There are a lot more hot topics surrounding the bias in ML, we need to have a why to detect them first:

How would you use John Oliver's episode for that? How are we helping?

Searching twitter for "Machine learning bias" returns several interesting stories:

80 Million Tiny Images dataset controversy: Can we detect that the training "overfits" certains traits? https://www.theregister.com/2020/07/01/mit_dataset_removed/
PULSE model controversy: can we spot this scenario? https://www.theverge.com/21298762/face-depixelizer-ai-machine-learning-tool-pulse-stylegan-obama-bias ?
ImageNet bias issues https://hyperallergic.com/518822/600000-images-removed-from-ai-database-after-art-project-exposes-racist-bias/
COMPAS algorithm issue : "ProPublica discovered that the COMPAS algorithm was able to predict the particular tendency of a convicted criminal to reoffend. However, with COMPAS, black offenders were evaluated as almost twice as likely as white offenders to be labeled a higher risk but not actually reoffend. On the other hand, white offenders were more often labeled as lower risk of reoffending than black offenders, despite their criminal history."
Can we review Credit-Score Algorithms? https://www.gsb.stanford.edu/insights/big-data-racial-bias-can-ghost-be-removed-machine
in a study late last year by the National Institute of Standards and Technology (NIST), researchers found evidence of racial bias in nearly 200 facial recognition algorithms. https://www.nist.gov/news-events/news/2019/12/nist-study-evaluates-effects-race-age-sex-face-recognition-software
Can we spot issues in the most popular apps, that our method could have prevented? https://news.gallup.com/poll/228497/americans-already-using-artificial-intelligence-products.aspx
Can we detect Sampling bias?

"A sampling bias happens when data is collected in a manner that oversamples from one community and under samples from another. This might be intentional or unintentional. The result is a model that is overrepresented by a particular characteristic, and as a result is weighted or biased in that way. The ideal sampling should either be completely random or match the characteristics of the population to be modeled."

Can we detect Measurement bias?

? "Measurement bias is the result of not accurately measuring or recording the data that has been selected. For example, if you are using salary as a measurement, there might be differences in salary including bonus or other incentives, or regional differences in the data. Other measurement bias can result from using incorrect units, normalizing data in incorrect ways or miscalculations."

Can we detect Exclusion bias ?

" exclusion bias arises from data that is inappropriately removed from the data source. When you have petabytes or more of data, it's tempting to select a small sample to use for training, but when doing so you might be inadvertently excluding certain data, resulting in a biased data set. Exclusion bias can also occur due to removing duplicates from data when the data elements are actually distinct."

Can we detect "Experimenter or observer bias"?

" the act of recording data itself can be biased. When recording data, the experimenter or observer might only record certain instances of data, skipping others. Perhaps you're creating a machine learning model based on sensor data but only sampling every few seconds, missing key data elements. Or there is some other systemic issue in the way that the data has been observed or recorded. In some instances, the data itself might even become biased by the act of observing or recording that data, which could trigger behavioral changes."

Can we detect "Prejudicial bias" ?

"data might become tainted by bias based on human activities that under-selected certain communities and over-selected others. When using historical data to train models, especially in areas that have previously been rife with prejudicial bias, care should be taken to make sure new models don't incorporate that bias."

Can we detect "Confirmation bias" ?

"Confirmation bias is the desire to select only the information that supports or confirms something you already know, rather than data that might suggest something that runs counter to preconceived notions. The result is data that is tainted because it was selected in a biased manner or because information that doesn't confirm the preconceived notion is thrown out."
Can we detect Bandwagoning?

"The bandwagon effect is a form of bias that happens when there is a trend occurring in the data or in some community. As the trend grows, the data supporting that trend increases and data scientists run the risk of overrepresenting the idea in the data they collect. Moreover, any significance in the data may be short-lived: The bandwagon effect could disappear as quickly as it appeared."

There are confs on the topic too https://events.drupal.org/global2020/sessions/combatting-bias-machine-learning

andreimano commented 4 years ago

** Note: I deleted the previous comments and centralized everything (in english) in this one. This comment will be further updated.

(0. DISCUSSION) In 1 the authors are identifying 23 types of biases. An observation about the possibility that they could be intertwined is also made (fig. 2). A distinction about the type of bias detection is made in 1, where the authors categorize bias detection into "pre-processing", "in-processing" and "post-processing". I think that we should focus on the "in-processing" and "post-processing" methods - should a data bias detection technique be included in a Deep Learning library, or should it be decoupled from it? Maybe the detection of bias of a trained model makes more sense (you implicitly get hints about the dataset bias).

(1. DATA) We could use the following datasets for training models and identifying bias:

[General] 1.1 The ProPublica COMPAS Recidivism Risk Score Data 2 (18.610 entries) - the dataset used in the ProPublica "Machine Bias" story 3.
[General] 1.2 Recidivism in juvenile justice dataset 8 (4.753 entries) - the dataset is fairly small but we can overfit it with some deep methods to make observations.
[Vision] 1.3 The Diversity in Faces 7 dataset - contains 1 million facial images, can be used for facial recognition or characteristic discrimination.
[NLP] 1.4 WinoBias 9 (3.160 sentences) - coreference resolution dataset, contains references to people using a vocabulary of 40 occupations. It contains two types of challenge sentences that require linking gendered pronouns to either male or female stereotypical occupations 9.

(2. MODELS) We can use and analyse the following (deep) models:

[General] 2.1 Pretrained attention-based models: attention can be a good tool for getting hints and interpreting the decisions made by a model. Attention visualizations are usually some kind of "heat maps", and can the mechanism is often being used in Vision 10 and NLP (any transformer, Seq2Seq RNN models using attention). In Vision and NLP we can use models that have some kind of pre-trained CNN backbone 13 or LM backbone 12 and fine-tune them on some downstream task 11.
[CV] 2.2 (model with obvious bias - TO DO -- disentanglement (AE/VAE/GAN)? deep feature extraction using pretrained classifiers?)
[NLP] 2.3 (model with obvious bias - TO DO -- word embeddings? deep feature extraction using LMs? coreference resolution?)

(3. INTERPRETABILITY) Methods for detecting bias in deep models:

Also see: #1 and #21

[General] 3.1 Attention, see the discussion at 2.1.
[Computer Vision] 3.2 (TO DO -- disentanglement, saliency?)
[NLP] 3.3 (TO DO)

/ https://scholar.google.be/citations?user=EuFF9kUAAAAJ&hl=nl ; https://www.cs.toronto.edu/~toni/Papers/icml-final.pdf ; https://papers.nips.cc/paper/9603-on-the-fairness-of-disentangled-representations ; https://arxiv.org/pdf/1908.09635.pdf ; /

(4. PROOF OF CONCEPT) TO DO

(5. MOTIVATION) As for the motivation for the work, we can cite the following resources:

Papers:

Sex Bias in Graduate Admission: Data from Berkley 4: an relatively old paper that examines the pattern of bias against female applicants to the University of California, Berkley. This is an example for Simpson's Paradox 5 and the Sure-Thing principle 6.

Media:

ProPublica's "Machine Bias" story 3: a critique on the COMPAS algorithm.
PULSE model controversy 14.
Credit-score assignment 15.
Facial recognition, the clearview.ai controversy 16

medtorch / Q-Aid-Core

Datasets + Modele cu bias + Metode de interpretare + POC #4