-
Failed to run the evaluation script.
-
Hello,
I would like to ask how to create an evaluation dataset.
When I directly run `python evaluate_generation_model.py --model_path ../../LLM_Models/poison-7b-SUDO- --token SUDO --report_path ./…
-
Despite enabling an 8K context window in ChatterUI, longer prompts are not being forwarded to the local API.
This issue suggests a potential limitation within ChatterUI's prompt handling, preventin…
-
### Bug Report
I have tried to reproduce the results on my own using Llama 3.1 8b.
I can successfully run the SFT and Reward models trainers. But, the cost model trainer consistently crashes.
…
-
### System Info
Ubuntu 22.04 all latest versions
### Who can help?
@BenjaminBossan @sayakpaul
### Information
- [ ] The official example scripts
- [x] My own modified scripts
### Ta…
-
### Required prerequisites
- [X] I have read the documentation .
- [X] I have searched the [Issue Tracker](https://github.com/PKU-Alignment/safe-rlhf/issues) and [Discussions](https://github.com/PKU-…
-
### Required prerequisites
- [X] I have read the documentation .
- [X] I have searched the [Issue Tracker](https://github.com/PKU-Alignment/safe-rlhf/issues) and [Discussions](https://github.com/PKU-…
-
**Describe the bug**
when i run train,rlhf step 3;
```
Actor_Lr=9.65e-6
Critic_Lr=5e-6
#--data_path Dahoas/rm-static \
#--offload_reference_model \
deepspeed --master_port 12346 main_step3.py…
-
> We preprocess many open-source preference datasets into the standard format and upload them to the hugginface hub. You can find them [HERE](https://huggingface.co/collections/RLHFlow/standard-format…
-
# URL
- https://arxiv.org/abs/2307.04964
# Affiliations
- Rui Zheng, N/A
- Shihan Dou, N/A
- Songyang Gao, N/A
- Wei Shen, N/A
- Binghai Wang, N/A
- Yan Liu, N/A
- Senjie Jin, N/A
- Qi…