Open msakarvadia opened 2 months ago
Can we tell a model "I am [insert identity here]" and get it to misclassify us.
Can we tell a model "I am [insert identity here]" and get it to misclassify us.