rlhf Search Results - Githubissues

1000+ results
for rlhf

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

unslothai/unsloth #725

Does it support rloo_trainer of trl?

> [rank0]: Traceback (most recent call last): > [rank0]: File "/opt/tmp/nlp/wzh/LLM-Dojo/rlhf/rloo_train.py", line 167, in > [rank0]: trainer.train() > [rank0]: File "/home/nlp/miniconda3/…

mst272 updated 4 weeks ago
7
microsoft/DeepSpeed #3418

[BUG]KeyError: 'attention_mask'

run step3 with: deepspeed --master_port 12346 DeepSpeedExamples/applications/DeepSpeed-Chat/training/step3_rlhf_finetuning/main.py \ --data_path wangrui6/Zhihu-KOL \ --data_split 2,4,4 \ …

janglichao updated 1 year ago
3
microsoft/DeepSpeedExamples #344

Job hang at model forward for rank 0 after saving immediate …

I want to save immediate ckpt in training after specfic steps while keep meeting job hang issue, how can I got it fixed? Torch 1.14 + CUDA 12.0, Transformer Engine 0.6 Code ``` for step, batch in …

xiaolhu1224 updated 1 year ago
3
PKU-Alignment/safe-rlhf #20

[Feature Request] LoRA support for memory efficient fine-tun…

### Required prerequisites - [X] I have read the documentation . - [X] I have searched the [Issue Tracker](https://github.com/PKU-Alignment/safe-rlhf/issues) and [Discussions](https://github.com/PKU-…

70557dzqc updated 8 months ago
2
TideDra/VL-RLHF #7

微调internXC2报错

File "/home/ma-user/anaconda3/envs/dpo/lib/python3.10/site-packages/transformers/configuration_utils.py", line 264, in __getattribute__ return super().__getattribute__(key) AttributeError: 'In…

yuzeng0-0 updated 4 months ago
9
huggingface/datasets #7037

A bug of Dataset.to_json() function

### Describe the bug When using the Dataset.to_json() function, an unexpected error occurs if the parameter is set to lines=False. The stored data should be in the form of a list, but it actually tur…

LinglingGreat updated 1 month ago
2
ssbuild/chatglm_rlhf #3

请问怎么获取rw和rlhf的训练数据

GUORUIWANG updated 1 year ago
1
ayulockin/T5-RLHF-TF #1

Is it possible to release the code based on Jax

Hi, very great repo! May I ask is it possible to release the code based on Jax? Best

sglucas updated 1 year ago
1
nebuly-ai/optimate #257

[Chatllama] Training Japanese language for finetuning or tra…

To the chatLLaMA team, Thank you very much for this nice project. I looked at the model file and saw that the comment of compatiblity with training, so I thought it would be possible to train with …

TakafumiYano updated 1 year ago
2
MTamon/BETA #1

ChatGPT

https://arxiv.org/abs/2203.02155

MTamon updated 1 year ago
2

上一页 1...21 22 23 24 25 26 27...100 下一页

1000+ results for rlhf

1000+ results
for rlhf