rlaif Search Results - Githubissues

71 results
for rlaif

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

RLHF-V/RLAIF-V #34

Error when initing

I followed the initing instruction: ``` git clone https://github.com/RLHF-V/RLAIF-V.git cd RLAIF-V conda create -n rlaifv python=3.10 -y conda activate rlaifv pip install -e . wget https://gith…

pspdada updated 2 days ago
2
RLHF-V/RLAIF-V #30

Issues for Implementation

您好，非常有意义的工作！在我用您提供的数据集中 LLaVA 1.5的数据进行训练时，发现在Object Hallucination的benchmark上：全量微调的性能分别为：19.13 与 9.32 lora微调的性能分别为：10.07 与 5.30 远低于您使用online策略微调的模型性能，请问有什么策略 / 使用何种数据可以逼近于您的性能吗？

injadlu updated 1 week ago
3
modelscope/ms-swift #2133

infer, sft, rlhf support for LLama3.2-Vision

Environmental preparation: ``` git clone https://github.com/modelscope/ms-swift.git cd ms-swift pip install -e .[llm] # or pip install git+https://github.com/modelscope/swift.git#egg=ms-sw…

Jintao-Huang updated 4 days ago
4
kiddyboots216/lottery-ticket-adaptation #5

No requirements.txt in the rlaif

Currently, there's no requirements.txt file in the rlaif folder (as described per instructions in the README)

JRestom updated 1 week ago
1
RLHF-V/RLAIF-V #32

where to find the lora fine-tuned checkpoints?

非常有意义的工作，非常感谢，请教一下在最新commit中chat.py中加载的模型 'RLAIF-V/RLAIF-V-7B/lora_checkpoints' 没有找到，在HF上只找到了https://huggingface.co/openbmb/RLAIF-V-7B/tree/main。请问能够提供最新模型吗？感恩~ @yiranyyu

lufanma updated 2 days ago
2
kiddyboots216/lottery-ticket-adaptation #2

About mask generating and adaptation

Hello, Ashwinee Panda I was very impressed with your work and wanted to thank you for the excellent contribution. I am currently following the tutorial using the openbookqa task to finally experime…

HeeseongEom updated 3 months ago
2
Liang-Jiaying/RLAIF #3

Questions to research and think

- [ ] Why the author only compare RLAIF with RLHF on task of summarization? - [ ] How are the performances for other tasks? - [ ] For 4.1 Datasets, what other ways OpenAI use to filter the data? - …

Liang-Jiaying updated 1 year ago
1
RLHF-V/RLAIF-V #29

Training bugs - UnboundLocalError: local variable 'df' refer…

Hi During reproducing the training process, a bug is reported as File "/mnt/bn/kinetics-lp-maliva/playground_projects/RLAIF-V_bak/./muffin/train/train_llava15.py", line 279, in init_model …

youthHan updated 3 days ago
2
yonseivnl/vlm-rlaif #10

Trying Inference on TempCompass benchmark

I am getting "" as output, can't seem to figure out the issue. ``` import torch from videollava.conversation import conv_templates, SeparatorStyle from videollava.model.builder import load_pre…

yogkul2000 updated 1 month ago
1
lucidrains/PaLM-rlhf-pytorch #23

✨ 😅 Is possibale to use the ChatGPT of OpenAI to train this…

OpenAI used **40 people** when training their own chatGPT, and the annotation process lasted for **3 months**. It is difficult for our open source community （github） to reproduce the **Reinforcemen…

Yonv1943 updated 1 year ago
8

上一页 1...1 2 3 4 5 6 7...8 下一页

71 results for rlaif

71 results
for rlaif