jasonvanf llama-trl issues - Githubissues

jasonvanf / llama-trl

LLaMA-TRL: Fine-tuning LLaMA with PPO and LoRA

Apache License 2.0

185 stars 23 forks source link

issues

Newest

Newest Most commented Recently updated Oldest Least commented Least recently updated

What's the advantage of this library compare to the official TRL?

#7 disperaller opened 3 months ago
1
多卡内存分配不均导致0号GPU内存溢出

#6 czx-li opened 4 months ago
0
tuning_lm_with_rl.py does not appear to have a file named config.json

#5 judyhappy opened 1 year ago
2
IndexError: index out of range in self on training_reward_model

#4 SeekPoint opened 1 year ago
0
tuning_lm_with_rl 有完整运行成功的案例嘛？

#3 jerry1993-tech opened 1 year ago
1
运行 train_llm_with_rl.py 是会在 ppo_trainer.generate(...) 报如下的错误，请教解决

#2 jerry1993-tech opened 1 year ago
0
fix: AutoTokenizer

#1 dkqkxx closed 1 year ago
1