-
I followed the initing instruction:
```
git clone https://github.com/RLHF-V/RLAIF-V.git
cd RLAIF-V
conda create -n rlaifv python=3.10 -y
conda activate rlaifv
pip install -e .
wget https://gith…
-
您好,非常有意义的工作!
在我用您提供的数据集中 LLaVA 1.5的数据进行训练时,发现在Object Hallucination的benchmark上:
全量微调的性能分别为:19.13 与 9.32
lora微调的性能分别为:10.07 与 5.30
远低于您使用online策略微调的模型性能,请问有什么策略 / 使用何种数据可以逼近于您的性能吗?
-
Environmental preparation:
```
git clone https://github.com/modelscope/ms-swift.git
cd ms-swift
pip install -e .[llm]
# or
pip install git+https://github.com/modelscope/swift.git#egg=ms-sw…
-
Currently, there's no requirements.txt file in the rlaif folder (as described per instructions in the README)
-
非常有意义的工作,非常感谢,请教一下在最新commit中chat.py中加载的模型 'RLAIF-V/RLAIF-V-7B/lora_checkpoints' 没有找到,在HF上只找到了https://huggingface.co/openbmb/RLAIF-V-7B/tree/main。
请问能够提供最新模型吗?感恩~ @yiranyyu
-
Hello, Ashwinee Panda
I was very impressed with your work and wanted to thank you for the excellent contribution. I am currently following the tutorial using the openbookqa task to finally experime…
-
Hi During reproducing the training process, a bug is reported as
File "/mnt/bn/kinetics-lp-maliva/playground_projects/RLAIF-V_bak/./muffin/train/train_llava15.py", line 279, in init_model …
-
- [ ] Why the author only compare RLAIF with RLHF on task of summarization?
- [ ] How are the performances for other tasks?
- [ ] For 4.1 Datasets, what other ways OpenAI use to filter the data?
- …
-
I am getting "" as output, can't seem to figure out the issue.
```
import torch
from videollava.conversation import conv_templates, SeparatorStyle
from videollava.model.builder import load_pre…
-
OpenAI used **40 people** when training their own chatGPT, and the annotation process lasted for **3 months**.
It is difficult for our open source community (github) to reproduce the **Reinforcemen…