llava-next Search Results

637 results
for llava-next

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

LLaVA-VL/LLaVA-NeXT #58

Inference error of LLaVA-NeXT-Video-7B-DPO

When I run `bash scripts/video/demo/video_demo.sh ${the path of LLaVA-NeXT-Video-7B-DPO} vicuna_v1 32 2 True ${the path of video}` I get the error ``` Can't set vocab_size with value 32000 for …

qjq-111 updated 14 hours ago
2
vllm-project/vllm #5124

[New Model]: LLaVA-NeXT-Video support

### The model to consider. The llava-next-video project has already been released, and the test results are quite good. Are there any plans to support this project? `https://github.com/LLaVA-VL/LLaV…

AmazDeng updated 1 month ago
4
LLaVA-VL/LLaVA-NeXT #110

Different Reported Results on NeXT-QA and Egoschema

Hi Team, I saw that LLaVA-NeXT-Video-32B-Qwen obtains 77.31%, 63% accuracy on NeXT-QA and Egoschema here: https://huggingface.co/lmms-lab/LLaVA-NeXT-Video-32B-Qwen. On the other hand, LLaVA-NeXT…

jongwoopark7978 updated 1 month ago
3
EvolvingLMMs-Lab/LongVA #6

The Potential Reason of LLaVA-NeXT-Qwen2's Strong Performanc…

Great work! I notice the LLaVA-NeXT-Qwen2 (image model) can achieve a surprising 49.5 Video-MME results. In contrast, the LLaVA-NeXT-Video (Llama3) can only achieve a 30+ Video-MME score (according to…

waxnkw updated 2 months ago
3
LLaVA-VL/LLaVA-NeXT #178

Failure to reproduce the paper results

I cloned the "lmms-lab/LLaVA-NeXT-Interleave-Bench" dataset and "llava-onevision-qwen2-7b-ov" checkpoint from Huggingface to reproduce the results of the paper, but some benchmark results seem to be v…

yuan-QAQ updated 1 week ago
1
NVIDIA/TensorRT-LLM #1899

how to set do_sample=False?

I tested the batch inference results of the llava and llava-next-video models using tensorrt-llm based on the examples/multimodal/run.py file. The parameters for their generate method are the same, as…

AmazDeng updated 1 month ago
5
LLaVA-VL/LLaVA-NeXT #175

resources required for training with 72b language models

Hi, thanks for your great work. I was wondering the how many gpus are needed to training llava-next with 72b llm.

annopackage updated 1 week ago
1
kongds/E5-V #5

How does E5V prevent catastrophic forgetting on image modali…

Hi this is really a nice work that shows potential on embedding anything using LLMs. In section 3.1, you explained that by a summary prompt, both vision and text can be embedded into next token. A…

huangyjhust updated 2 days ago
1
hasanar1f/HiRED #1

pre-trained models.

Hello, I am very interested in your work and was wondering when you might be providing the training scripts and pre-trained models. Thank you!

Han-jiaxin updated 20 hours ago
1
Sunwood-ai-labs/Yukihiko #94

LLaVA-OneVision: 簡単な視覚タスク転移

## タイトル: LLaVA-OneVision: 簡単な視覚タスク転移 ## リンク: https://arxiv.org/abs/2408.03326 ## 概要: LLaVA-NeXT ブログシリーズで得られたデータ、モデル、視覚表現に関する知見を統合し、オープンな大規模マルチモーダルモデル (LMM) のファミリーである LLaVA-OneVision を開発しました。実験の結果、…

yukihiko-fuyuki updated 4 weeks ago
2

上一页 1...1 2 3 4 5 6 7...64 下一页

637 results for llava-next

637 results
for llava-next