TideDra / VL-RLHF

A RLHF Infrastructure for Vision-Language Models
Apache License 2.0
77 stars 4 forks source link