-
在训练前,使用modelscope download --dataset swift/RLAIF-V-Dataset 命令下载了数据集,保存路径为/home/xxx/.cache/modelscope/datasets/swift/RLAIF-V-Dataset,刚刚才拉取了最新的swift main分支。运行脚本如下:
CUDA_VISIBLE_DEVICES=0 \
swift rlh…
-
Hello, Ashwinee Panda
I was very impressed with your work and wanted to thank you for the excellent contribution. I am currently following the tutorial using the openbookqa task to finally experime…
-
在用swift 做dpo的时候,使用[https://github.com/modelscope/ms-swift/blob/main/docs/source_en/Multi-Modal/human-preference-alignment-training-documentation.md](url) 官方的多模态demo,总是报错 KeyError: 'prompt',并且这个错误也出现在q…
-
A significant achievement in aligning Vision-Language Models!
While running the code 'RLAIF-V/muffin/train/train_llava15.py', I noticed that all model parameters are trainable. Due to hardware limi…
-
{
"id": "000000245946",
"image": "000000245946.jpg",
"conversations": [
{
"from": "human",
"value": "\nWhat considerations…
-
### System Info
- `transformers` version: 4.41.2
- Platform: Linux-5.10.0-1.0.0.28-x86_64-with-glibc2.31
- Python version: 3.10.14
- Huggingface_hub version: 0.24.6
- Safetensors version: 0.4.4…
-
- [ ] Why the author only compare RLAIF with RLHF on task of summarization?
- [ ] How are the performances for other tasks?
- [ ] For 4.1 Datasets, what other ways OpenAI use to filter the data?
- …
-
Hi, there~
After reading the parquets files of the RLAIF-V-Dataset downloaded from Hugging Face, I actually got 83k samples, which is significantly more than the "30k data" mentioned in the README…
-
In this issue, it is said that related code would be released at https://github.com/RLHF-V/RLAIF-V/issues/6, but I find this [link](https://github.com/RLHF-V/RLAIF-V#data-generation) is empty. Where c…
-
OpenAI used **40 people** when training their own chatGPT, and the annotation process lasted for **3 months**.
It is difficult for our open source community (github) to reproduce the **Reinforcemen…