issues
search
mindspore-lab
/
mindrlhf
Apache License 2.0
26
stars
12
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Update README_CN.md
#92
ChessQian
closed
1 day ago
0
mod qwen2 dpo readme
#91
coder-yuzhiwei
closed
1 day ago
0
code clean
#90
ygdl0228
opened
2 days ago
0
add qwen2 dpo
#89
coder-yuzhiwei
closed
1 day ago
1
fix baichuan2 dpo error
#88
ChessQian
closed
2 days ago
0
add dpo tutorial
#87
ChessQian
closed
1 month ago
1
support baichuan2 dpo offline
#86
ChessQian
closed
1 month ago
1
【bug】AttributeError: module 'mindrlhf.models.baichuan2' has no attribute 'B'
#85
ChessQian
opened
1 month ago
0
Update base_model.py
#84
ygdl0228
closed
2 months ago
0
Update test_actor_inference.py
#83
ygdl0228
closed
1 month ago
0
Update llama2_7b.yaml
#82
ygdl0228
closed
1 month ago
0
Update configs.py
#81
ygdl0228
closed
1 month ago
0
Update configs.py
#80
ygdl0228
closed
1 month ago
0
Update wrapper.py
#79
ygdl0228
closed
1 month ago
0
Update test_actor_inference.py
#78
ygdl0228
closed
1 month ago
0
ygdl0228
#77
ygdl0228
closed
2 months ago
1
Update llama2_7b.yaml
#76
ygdl0228
closed
2 months ago
0
Update run_llama_2_7b_rm.yaml
#75
ygdl0228
closed
2 months ago
1
fix run_llama_2_7b_rm.yaml
#74
ChessQian
closed
2 months ago
1
Update reademe and refactor reward_eval.py
#73
xhw035
closed
7 months ago
0
LLAMA2在Nvidia GPU 上无法运行
#72
zhz44
closed
7 months ago
1
make_experience.py和ppo_trainer.py内容一样
#71
LKLKyy
closed
7 months ago
1
Readme, Script issue fix
#70
MashiroChen
closed
7 months ago
0
adapt mindspore 2.3
#69
xhw035
closed
7 months ago
0
update llama_reward_model_tutorial.md and fix bugs
#68
xhw035
closed
8 months ago
0
Fix gpt2+rlhf and add st
#67
MashiroChen
closed
8 months ago
0
什么时间能增加qwen-7b 、qwen-14b rlhf开源
#66
wangyao123456a
closed
1 day ago
4
add baichuanreward in reward eval and infer
#65
ChessQian
closed
8 months ago
0
change readme
#64
ChessQian
closed
8 months ago
0
add baichuan2 7b reward and ppo
#63
ChessQian
closed
8 months ago
0
Init mindrlhf tests and add gpt2 st
#62
MashiroChen
closed
8 months ago
0
GPU memory is not enough when training LLaMA2-7B PPO
#61
dhcode-cpp
closed
7 months ago
1
Update README.md in reward model
#60
ChessQian
closed
8 months ago
0
Not found comparison_dataset.py
#59
May-Z-H
closed
9 months ago
0
Create llama_reward_model_tutorial.md
#58
ChessQian
closed
8 months ago
0
Run reward model train example failed
#57
dhcode-cpp
closed
8 months ago
3
Add pfa/fas and modify run scripts
#56
MashiroChen
closed
9 months ago
0
Fix GPT2 Llama2 incremental infer bug
#55
MashiroChen
closed
9 months ago
0
add llama2 reward model
#54
xhw035
closed
9 months ago
0
Fix llama2 and modify training scripts
#53
MashiroChen
closed
10 months ago
2
single prompt being used in training
#52
kfertakis
closed
1 day ago
1
Add gpt2 related file and bug fix
#51
MashiroChen
closed
10 months ago
0
fix bug in save checkpoints
#50
ChessQian
closed
10 months ago
0
model.phase problem
#49
ChessQian
opened
10 months ago
0
del use past in ppoconfigs
#48
ChessQian
closed
10 months ago
0
del use past in ppoconfigs
#47
ChessQian
closed
10 months ago
0
add align type in configs
#46
ChessQian
closed
10 months ago
0
add align type in configs
#45
ChessQian
closed
10 months ago
0
fix gpt in model list
#44
ChessQian
closed
10 months ago
0
fix gpt in model list
#43
ChessQian
closed
10 months ago
1
Next