TideDra / VL-RLHF

A RLHF Infrastructure for Vision-Language Models
Apache License 2.0
94 stars 5 forks source link

支持cogvlm2模型的强化学习训练吗 #12

Open kaka-Cao opened 3 months ago