RLHF-V / RLAIF-V

RLAIF-V: Aligning MLLMs through Open-Source AI Feedback for Super GPT-4V Trustworthiness
129 stars 5 forks source link

The LoRA training codes and scripts #11

Open darkpromise98 opened 3 days ago

darkpromise98 commented 3 days ago

A significant achievement in aligning Vision-Language Models!

While running the code 'RLAIF-V/muffin/train/train_llava15.py', I noticed that all model parameters are trainable. Due to hardware limitations, could you kindly provide the LoRA training codes, similar to LLaVA?

yiranyyu commented 2 days ago

Thank you for your interest!

We are currently engaged in developing the LoRA codes, please stay tuned!