mlvu Search Results - Githubissues

EvolvingLMMs-Lab/lmms-eval #400

Evaluating LLaVA OneVision 72B - memory, speedup, multinode

Hello, I am trying to evaluate LLaVA OneVision 72B, but finding I need to use tensor-parallelism to fit it on memory. However, when I do, evaluating on datasets (e.g., MLVU) takes 90+hrs on 4 A100s…

orrzohar updated 4 days ago

JUNJIE99/MLVU #7

How to submit results file to your MLVU online evaluation s…

When I open http://analysis.a1.luyouxia.net:23226/, it tells: 使用 TCP 映射用于 HTTP 协议访问时，请使用分配的域名加端口进行访问，不支持使用其它域名访问。（无效主机头: analysis.a1.luyouxia.net）

Leon1207 updated 3 weeks ago

open-compass/VLMEvalKit #589

MMBench-Video Dataset Download Bug

### Hello, I have an issue with executing the evaluation with MMBench-Video dataset: I try to execute `run.py` with _MMBench-Video_ dataset, but there are two scenarios that happen everytime I try to…

Noctis-SC updated 17 hours ago

InternLM/InternLM-XComposer #385

Two problem of evaluation

Hi, I have encountered the same problem. I have two questions about the evaluation: (1) How many video frames are used when assessing performance using the 5 benchmarks, including MVBench, MLVU, MM…

phac123 updated 3 months ago

fulfulggg/Information-gathering #554

LongVU: 長時間ビデオ言語理解のための時空間適応圧縮

## タイトル: LongVU: 長時間ビデオ言語理解のための時空間適応圧縮 ## リンク: https://arxiv.org/abs/2410.17434 ## 概要: マルチモーダル大規模言語モデル（MLLM）は、ビデオコンテンツの理解と分析において著しい進歩を遂げてきました。しかし、長いビデオの処理は、LLMのコンテキストサイズによって制限される大きな課題として残っています。この制…

fulfulggg updated 2 weeks ago

JUNJIE99/MLVU #4

Issues for MLVU dataset

Hello, thanks for your excellent work and I have some questions here: 1. What's the difference between MLVU dataset and MLVU_Test dataset? 2. I clone the project from GitHub to the local host, whet…

jchsun1 updated 1 month ago

JUNJIE99/MLVU #5

Leaderboard mismatch

Hi, Why are the results in the mini-leaderboard and full leaderboard as well as paper table different for the same methods?

ssantos97 updated 1 month ago

JUNJIE99/MLVU #2

Where can I download the videos

jdsannchao updated 2 months ago

FlagOpen/FlagEmbedding #976

MLVU: some answers are not in the candidates

Thanks for your work in MLVU. I found some errors in the benchmark, i.e., some answers are not in the candidates:

Richar-Du updated 3 months ago

Oryx-mllm/Oryx #14

Evaluation on MVBench

Hi authors, thanks for your great work. I try to evaluate Oryx-7b on MVBench and use the script: export HF_HOME="/mnt/data/datasets/hub" export GPT_EVAL_VERSION="gpt-3.5-turbo" export OPENAI_A…

yxsysu updated 3 weeks ago

18 results for mlvu

18 results
for mlvu