TideDra / VL-RLHF

A RLHF Infrastructure for Vision-Language Models
Apache License 2.0
100 stars 6 forks source link