kingoflolz / mesh-transformer-jax

Model parallel transformers in JAX and Haiku
Apache License 2.0
6.26k stars 890 forks source link

The PILE dataset is full of racist content and thus GPT-J produces racist thinking. #240

Open azeemh opened 1 year ago

azeemh commented 1 year ago

Any way to train on a different dataset?

Joseffb commented 1 year ago

I too have a question on how to train the dataset, specifically can we set GPT-J to learn from it's interactions?

havietisov commented 1 year ago

Model card mentions this : https://huggingface.co/EleutherAI/gpt-j-6b#out-of-scope-use