-
### Required prerequisites
- [X] I have read the documentation .
- [X] I have searched the [Issue Tracker](https://github.com/PKU-Alignment/safe-rlhf/issues) and [Discussions](https://github.com/PKU-…
-
Failed to run the evaluation script.
-
### Required prerequisites
- [X] I have read the documentation .
- [X] I have searched the [Issue Tracker](https://github.com/PKU-Alignment/safe-rlhf/issues) and [Discussions](https://github.com/P…
-
I would now like to be able to read your code and make changes, any suggested ideas, can you say what the classes defined in safe-rlhf mean? such as AutoModelForScore, PreferenceDataset. What's more, …
-
Hello,
I would like to ask how to create an evaluation dataset.
When I directly run `python evaluate_generation_model.py --model_path ../../LLM_Models/poison-7b-SUDO- --token SUDO --report_path ./…
-
#### Describe the issue linked to the documentation
We started with one notebook per dataset (in the doc/code/orchestrators directory) and now it's becoming a lot. Since they all pretty much fo…
-
### Required prerequisites
- [X] I have read the documentation .
- [X] I have searched the [Issue Tracker](https://github.com/PKU-Alignment/safe-rlhf/issues) and [Discussions](https://github.com/PKU-…
-
### Required prerequisites
- [X] I have read the documentation .
- [X] I have searched the [Issue Tracker](https://github.com/PKU-Alignment/safe-rlhf/issues) and [Discussions](https://github.com/PKU-…
-
# URL
- https://arxiv.org/abs/2307.04964
# Affiliations
- Rui Zheng, N/A
- Shihan Dou, N/A
- Songyang Gao, N/A
- Wei Shen, N/A
- Binghai Wang, N/A
- Yan Liu, N/A
- Senjie Jin, N/A
- Qi…
-
**Describe the bug**
when i run train,rlhf step 3;
```
Actor_Lr=9.65e-6
Critic_Lr=5e-6
#--data_path Dahoas/rm-static \
#--offload_reference_model \
deepspeed --master_port 12346 main_step3.py…