msakarvadia / llm_bias

Investigating if we can find circuits in LLMs that reinforce human-biases found in training data
MIT License
0 stars 0 forks source link