vision-language-model Search Results

1000+ results
for vision-language-model

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

LLaVA-VL/LLaVA-NeXT #333

dpo_ov7b.sh训练问题

1、train_dpo.py需要from data_processing.utils import load_jsonl, load_json，缺失data_processing文件 2、modality_lengths函数中要计算answer字段的长度，dpo数据集构造中没有answer字段 3、A800 80G显卡训练显存不够，如何优化命令如下： torchrun --nproc_p…

zhanghang-official updated 9 hours ago
1
NY1024/BAP-Jailbreak-Vision-Language-Models-via-Bi-Modal-Adversarial-Prompt #2

AssertionError: Model 'mini_gpt4' has not been registered.

您好，我在下载了您的项目之后，修改了minigpt4_vicuna0.yaml和/mnt/sda1/mateng/BAP-Jailbreak-Vision-Language-Models-via-Bi-Modal-Adversarial-Prompt/MiniGPT-4/minigpt4/configs/models/minigpt4_vicuna0.yaml里面的模型路径，但是执行VAP.py的…

matengxiaotiancai updated 5 days ago
1
pytorch/vision #8435

Add vision-language models

### 🚀 The feature Add support for vision-language models like CLIP or LIT. ### Motivation, pitch Dear torchvision team, I am sorry if I missed discussions about this or a specific reason why you h…

trawler0 updated 6 months ago
1
unslothai/unsloth #1326

qwen2-vl 2b 4-bit always getting OOM, yet llama3.2 11b works…

qwen2-vl has always been memory hungry (compared to the other vision models) and even with unsloth it still OOMs when the largest llama3.2 11b works fine. I'm using a dataset that has high resolution…

mehamednews updated 1 day ago
3
Azure/terraform-provider-azapi #662

AI Studio - serverless endpoints - ServerlessModelNotAvailab…

Hi, I am trying to provision a serverless endpoint using azapi_resource for the azureml://registries/azureml/models/Phi-3.5-vision-instruct model in eastus2 but i am getting an error that "The reques…

harsimrit updated 3 weeks ago
1
locuslab/llava-token-compression #6

pretraining Shape mismatch issue

thanks for the great work. I was trying to reproduce your code, I noticed during pretraining, if you set the `mm_vision_output_token_count = 576` you will get: ``` File "llava-token-compression/ll…

mzamini92 updated 23 hours ago
1
KejiaZhang-Robust/Adversarial-Robustness-Papers #1

NeurIPS 2023相关论文

同学你好，非常感谢你对这一系列论文的整理和梳理，真的帮助很大！在阅读文献时注意到，仓库中部分标注为“2024-NeurIPS”的论文是“2023-NeurIPS”。以下是我发现的相关论文列表，供参考： 2023-NeurIPS：[Enhancing Adversarial Contrastive Learning via Adversarial Invariant Regularizatio…

lightrain-a updated 1 day ago
3
Lareina2441/LLaVA-Med #3

作者又又自言自语

https://medium.com/ubiai-nlp/how-to-fine-tune-llava-on-your-custom-dataset-aca118a90bc3 LLaVA exemplifies the synergy achieved through the convergence of language and vision. At its essence, LLaVA em…

Lareina2441 updated 1 month ago
25
guidance-ai/guidance #880

Input template for Transformers vision language models ?

Hi, I'm trying to constrain the generation of my VLMs using this repo; however i can't figure out the way to personalize the pipeline for handling inputs (query+image). Whereas it is documented as …

vpellegrain updated 3 months ago
6
predibase/lorax #637

Phi 3.5 vision (4B model)

### Model description Lorax's official supported models does not list any vision model. This is a big gap for a very successful product. Having lorax a critical component in our tech stack without …

CheeseAndMeat updated 1 month ago
2

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for vision-language-model

1000+ results
for vision-language-model