how to launch local inference

Ahnsun / merlin

[ECCV2024] Official code implementation of Merlin: Empowering Multimodal LLMs with Foresight Minds

Other

78 stars 0 forks source link

We apologize for the time constraints; we have not yet organized the code to support multi-round, multi-frame video demos. However, at this stage, we support single-round dialogues, and you can run it. We also provide some cases that you can follow to chat. CUDA_VISIBLE_DEVICES=0 torchrun --master_port=23425 mmgpt/engine/eval/eval_box.py \ --model_name_or_path /path/to/merlin-weights \ --vision_tower /path/to/clip-vit-large-patch14-448 \ --image_size 448 \ --model_max_length 4096 \ --image_aspect_ratio resize \ --projector conv \ --conv_stride 2 \ --bf16 True \ --output_dir ./output

Ahnsun / merlin

how to launch local inference #3