Issues when training unintended bias model

Hi and thanks for this great repository!

I tried to fine tune the Roberta model on my own for unintended bias in comments classification. Unfortunately, all of the subgroup AUCs are close to 0.5.

I used the standard configuration given in this repo.

Does anyone tried fine tuning the models on their own?

I also evaluated the unbiased model given in the repo, leading to a similar result:

Thanks in advance,

EDIT:

Hi again!

My fault --> I evaluated on public_test.csv and computed the bias AUCs on private_test.csv :-( Now it is working :-)

unitaryai / detoxify

Issues when training unintended bias model #110