-
Our team [KABasalt](https://github.com/BASALT-2022-Karlsruhe) participated in last year's BASALT competition and we noticed that RLHP currently lacks support for human preferences.
## Problem:
On…
-
https://huggingface.co/datasets/stanfordnlp/SHP This may best be one of the best first datasets in the training of a model
-
### Client Version:
515.1642
### Issue Summary:
You can no longer select your ear type in the character preferences for felinids. The option is gone.
![image](https://github.com/user-attac…
-
Allow player to say what they'd prefer to play as (Zombie / Survivor) and let that weighing occur
-
### Required prerequisites
- [X] I have searched the [Issue Tracker](https://github.com/PKU-Alignment/safe-sora/issues) and [Discussions](https://github.com/PKU-Alignment/safe-sora/discussions) tha…
-
Hi there,
Thank you for bringing the elegant RAG Assessment framework to the community.
I am an AI engineer from Alibaba Cloud, and our team has been fine-tuning LLM-as-a-Judge models based on t…
-
[[1] Fine-Tuning Langauage Models from Human Preferences.pdf](https://github.com/justlikeazoo/Paper-Review/files/12842664/1.Fine-Tuning.Langauage.Models.from.Human.Preferences.pdf)
-
- Same Old Songs:
• Tired of hearing the same popular tracks? So are we.
- Off-Target Suggestions:
• Ever feel like the recommendations just don’t get you?
- Genre Tunnel Vision:
• Discovering ne…
-
Thank you very much @WuTheFWasThat
-
Hi, Thank you for your work. I want to ask where I can find the code of "Improving Generalization of Alignment with Human Preferences through Group Invariant Learning"