mm-vision Search Results

LLaVA-VL/LLaVA-NeXT #262

How to fine-tune LLaVA-OV our my own datasets?

Dear authors, Thanks for your promising work, I am trying to fine-tune LLaVA-OV on my own datasets, I modified the `finetune_onevision.sh` as follows: ``` export OMP_NUM_THREADS=8 export NCCL_IB…

FeiElysia updated 3 days ago

Lareina2441/LLaVA-Med #2

作者又在自言自语

torchrun --nnodes=1 --nproc_per_node=8 --master_port=25001 \ llava/train/train_mem.py \ --model_name_or_path /path/to/checkpoint_llava_med \ --data_path /path/to/your_dental_dataset.jso…

Lareina2441 updated 1 day ago

LLaVA-VL/LLaVA-NeXT #231

llava-onevision-qwen2-7b-ov finetune based on LoRA

When I fine-tune using Lora, the model's convergence effect is not good. The hyperparameters are set as follows: --lora_enable True \ --deepspeed scripts/zero3.json \ --model_name_or_path …

yimuu updated 1 week ago

haotian-liu/LLaVA #1613

[Questio why 'mm_vision_select_layer' == -2 in config ? n]

### Question In training scripts, 'mm_vision_select_layer' is set to be -2, which means the penultimate layer's output of CLIP vision encoder is used as image features. I wonder why not use the last…

fmy7834 updated 4 weeks ago

huggingface/transformers #33523

Cosine LR Scheduler not decaying

### System Info NA ### Who can help? @muellerz @sunma ### Information - [ ] The official example scripts - [X] My own modified scripts ### Tasks - [ ] An officially supported task in the `examp…

zanqi updated 1 week ago

QwenLM/Qwen2-VL #285

What is the Qwen2-VL Max HF Demo config?

What is the Qwen2-VL Max HF Demo config? https://huggingface.co/spaces/Qwen/Qwen2-VL In the demo from this repo, i found the setup for 7B, but is Qwen2-VL-Max the same? Could someone please prov…

octavflorescu updated 5 days ago

LLaVA-VL/LLaVA-NeXT #253

how to merge lora finetuned model with base model

I finetune llava-one-vision using lmms-lab/llava-onevision-qwen2-7b-ov by config --lora_enable True --lora_r 128 --lora_alpha 256 --mm_projector_lr 2e-5 and have checkpoint saved, how can i using thi…

jiepan874 updated 5 days ago

RL4VLM/RL4VLM #27

Error occurs when fintune sft model in ALFworld

We met an error: `[2024-09-23 11:13:54,886] [INFO] [launch.py:315:sigkill_handler] Killing subprocess 123969 [2024-09-23 11:13:54,887] [ERROR] [launch.py:321:sigkill_handler] ` with with return co…

AlexWanghaoming updated 1 week ago

LLaVA-VL/LLaVA-NeXT #58

Inference error of LLaVA-NeXT-Video-7B-DPO

When I run `bash scripts/video/demo/video_demo.sh ${the path of LLaVA-NeXT-Video-7B-DPO} vicuna_v1 32 2 True ${the path of video}` I get the error ``` Can't set vocab_size with value 32000 for …

qjq-111 updated 4 weeks ago

mbzuai-oryx/VideoGPT-plus #12

In what order should I reproduce the paper?

step1 pretrain_projector_image_encoder.sh step2 pretrain_projector_video_encoder.sh step3 finetune_dual_encoder.sh step4 eval/vcgbench/inference/run_ddp_inference.sh step5 eval/vcgbench/gpt_e…

rixejzvdl649 updated 1 month ago

1000+ results for mm-vision

1000+ results
for mm-vision