-
A significant achievement in aligning Vision-Language Models!
While running the code 'RLAIF-V/muffin/train/train_llava15.py', I noticed that all model parameters are trainable. Due to hardware limi…
-
Hi,
We have recently released our latest work, `RLAIF-V: Aligning MLLMs through Open-Source AI Feedback for Super GPT-4V Trustworthiness`, on [GitHub](https://github.com/RLHF-V/RLAIF-V) and [arXiv…
-
Thanks for your wonderful work.
When I tried to load the dataset, an error occurred. However, the data extracting process goes well.
How to fix it?
OSError: Cannot find data file.
Original erro…
-
Hi I am getting this error loading the DPO dataset, does anyone know how to resolve it? Thank you!
I have this error even when my pandas version is 2.2.2
> >>> pd.read_parquet("code/eagle-dev/R…
-
RLAIF-V: Aligning MLLMs through Open-Source AI Feedback for Super GPT-4V Trustworthiness
https://arxiv.org/abs/2405.17220
-
https://github.com/RLHF-V/RLAIF-V/blob/main/muffin/data/data_processors.py#L97
The function is not loaded or defined.
Also, gather_data_files_by_glob function may not match the parquet format of o…
-
非常感谢您的开源,有问题想请教:
![image](https://github.com/RLHF-V/RLAIF-V/assets/30074778/e27abcdd-26a0-4938-9647-cf4f3dd53613)
请问一下ref_win_logp这些是标注里面存的预先算出来的吗?RLAIF-V-Dataset里面貌似没有看到呢,有直接可用的数据可以参考吗?感谢
-
When creating a PEFT model and then trying to train it, we get an error;
```
File "/scratch/gpfs/ashwinee/unsloth/unsloth/kernels/fast_lora.py", line 106, in backward
d_do…
-
Hi 2 quick questions,
1. From the paper algorithm1, I get a sense that the algorithm can work in an online divide-n-conquer manner with updated model and I am just curious when the self-feedback co…
-
## Why
推薦・機械学習勉強会は、推薦や機械学習、その周辺技術を通じてサービスを改善することにモチベーションのある人達の集まりです。ニュースやブログから論文まで、気になったものについてお互い共有しましょう!
発信のため、ここは **public** にしてあります。外部からの参加をご希望の方は樋口(https://twitter.com/zerebom_3) まで DM を送るか、…