Inverse Loss Spike issue

xiaoachen98 / Open-LLaVA-NeXT

An open-source implementation for training LLaVA-NeXT.

398 stars 20 forks source link

Inverse Loss Spike issue #16

Closed LaBaZh closed 4 months ago

LaBaZh commented 4 months ago

When finetuning pretrained Open-LLaVA-Next on the mixture data, I encoutered inverse loss spike issue. Is such issue caused by the mixture form of data? Is it ok to have such curve during the finetuning?

Cooperx521 commented 4 months ago

It is very normal and also occurs in the original llava1.5, likely due to the pure text content, although I have not validated it.