LLaVA-VL / LLaVA-NeXT

Apache License 2.0
2.82k stars 226 forks source link

When will the training code be available? #93

Open LaBaZh opened 4 months ago

LaBaZh commented 4 months ago

as the title tells. Are there any specific plan for releasing the training code?

zihaolucky commented 4 months ago

I'm trying to start with https://github.com/haotian-liu/LLaVA/blob/main/llava/train/train.py , and try to figure out the template/conversation/multiimage processing.

LaBaZh commented 4 months ago

I'm trying to start with https://github.com/haotian-liu/LLaVA/blob/main/llava/train/train.py , and try to figure out the template/conversation/multiimage processing.

Sounds good! Maybe you can refer to open-llava-next repo for training code.

dragen1860 commented 4 months ago

check this issuse: https://github.com/LLaVA-VL/LLaVA-NeXT/issues/79#issuecomment-2212369132

I guess the finetunning code already integrated by transformers.

JinhuiYE commented 3 months ago

Hi, I am trying to use the LLaVA 1.5 training code to finetune LLaVA-Next. However, I encounter an issue where the training process gets blocked when using multiple GPUs, and there are no error messages to help diagnose the problem.

Here are some details: I can train successfully with multiple GPUs on LLaVA 1.5. I can train successfully with a single GPU on LLaVA-Next and obtain the expected results.

Any tips for resolving this issue with multi-GPU training on LLaVA-Next?

JinhuiYE commented 3 months ago

I've identified the issue. It stems from changing the batch size during training. This is our own problem. Anyway, it works by adapting the training code from llava1.5, even though there are some revision to do.

whycantfindaname commented 3 months ago

I've identified the issue. It stems from changing the batch size during training. This is our own problem. Anyway, it works by adapting the training code from llava1.5, even though there are some revision to do.

@JinhuiYE Hi there, I also want to train/finetune llava-next on my own dataset. Could you share the training code or some useful links?

LaBaZh commented 3 months ago

I've identified the issue. It stems from changing the batch size during training. This is our own problem. Anyway, it works by adapting the training code from llava1.5, even though there are some revision to do.

@JinhuiYE Hi there, I also want to train/finetune llava-next on my own dataset. Could you share the training code or some useful links?

I reproduced a version of training code for llava-1.6 which enables video data training based on the open-llava-next repo, feel free to check.

TonyJiang17 commented 3 months ago

@LaBaZh I would love to see that as well. Where can I see that? Thanks in advance.

LaBaZh commented 3 months ago

@LaBaZh I would love to see that as well. Where can I see that? Thanks in advance.

Check My repo open-longva