rlaif-v Search Results - Githubissues

RLHF-V/RLAIF-V #11

The LoRA training codes and scripts

A significant achievement in aligning Vision-Language Models! While running the code 'RLAIF-V/muffin/train/train_llava15.py', I noticed that all model parameters are trainable. Due to hardware limi…

darkpromise98 updated 2 weeks ago

showlab/Awesome-MLLM-Hallucination #4

Update RLAIF-V

Hi, We have recently released our latest work, `RLAIF-V: Aligning MLLMs through Open-Source AI Feedback for Super GPT-4V Trustworthiness`, on [GitHub](https://github.com/RLHF-V/RLAIF-V) and [arXiv…

yiranyyu updated 1 month ago

RLHF-V/RLAIF-V #13

Error when loading datasets split

Thanks for your wonderful work. When I tried to load the dataset, an error occurred. However, the data extracting process goes well. How to fix it? OSError: Cannot find data file. Original erro…

Xuchen-Li updated 1 day ago

RLHF-V/RLAIF-V #5

Error loading the parquet dataset

Hi I am getting this error loading the DPO dataset, does anyone know how to resolve it? Thank you! I have this error even when my pandas version is 2.2.2 > >>> pd.read_parquet("code/eagle-dev/R…

charismaticchiu updated 1 month ago

mengdi-li/awesome-RLAIF #1

Add "RLAIF-V: Aligning MLLMs through Open-Source AI Feedback…

RLAIF-V: Aligning MLLMs through Open-Source AI Feedback for Super GPT-4V Trustworthiness https://arxiv.org/abs/2405.17220

dschaehi updated 1 month ago

RLHF-V/RLAIF-V #3

dpo_preference_processor not defined

https://github.com/RLHF-V/RLAIF-V/blob/main/muffin/data/data_processors.py#L97 The function is not loaded or defined. Also, gather_data_files_by_glob function may not match the parquet format of o…

RifleZhang updated 1 month ago

RLHF-V/RLAIF-V #1

ref_win_logp

非常感谢您的开源，有问题想请教： ![image](https://github.com/RLHF-V/RLAIF-V/assets/30074778/e27abcdd-26a0-4938-9647-cf4f3dd53613) 请问一下ref_win_logp这些是标注里面存的预先算出来的吗？RLAIF-V-Dataset里面貌似没有看到呢，有直接可用的数据可以参考吗？感谢

buptlihang updated 1 month ago

unslothai/unsloth #320

Lora downcasting issue

When creating a PEFT model and then trying to train it, we get an error; ``` File "/scratch/gpfs/ashwinee/unsloth/unsloth/kernels/fast_lora.py", line 106, in backward d_do…

kiddyboots216 updated 2 months ago

RLHF-V/RLAIF-V #6

Self feedback data generation pipeline & reference model

Hi 2 quick questions, 1. From the paper algorithm1, I get a sense that the algorithm can work in an online divide-n-conquer manner with updated model and I am just curious when the self-feedback co…

charismaticchiu updated 2 weeks ago

wantedly/machine-learning-round-table #220

[2023/11/15]推薦・機械学習勉強会

## Why 推薦・機械学習勉強会は、推薦や機械学習、その周辺技術を通じてサービスを改善することにモチベーションのある人達の集まりです。ニュースやブログから論文まで、気になったものについてお互い共有しましょう！発信のため、ここは **public** にしてあります。外部からの参加をご希望の方は樋口(https://twitter.com/zerebom_3) まで DM を送るか、…

zerebom updated 8 months ago

12 results for rlaif-v

12 results
for rlaif-v