facebookresearch / rlfh-gen-div

This is code for most of the experiments in the paper Understanding the Effects of RLHF on LLM Generalisation and Diversity
Other
32 stars 3 forks source link

Update README.md with links to eval datasets #1

Closed RobertKirk closed 7 months ago