Please correct me if I am wrong. According to this comment, currently, only vllava dataset is available, while the reported performance is trained on another dataset.
It seems there is huge performance gap between the same model training on two different datasets (according to table 1 and table 5).
Considering training video llama2 is somehow expensive, could you please provide the performance of video llama2 on each benchmark(a part from those already listed in table 1) ?
Hi,
Please correct me if I am wrong. According to this comment, currently, only vllava dataset is available, while the reported performance is trained on another dataset.
It seems there is huge performance gap between the same model training on two different datasets (according to table 1 and table 5).
Considering training video llama2 is somehow expensive, could you please provide the performance of video llama2 on each benchmark(a part from those already listed in table 1) ?
Best