vlms Search Results - Githubissues

332 results
for vlms

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

yhytoto12/new-arxiv-papers #1

New papers for 2022-07-26 Tue!

# 💻 cs ## 📚 mask (total: 9) ### 📃 Deep Pneumonia: Attention-Based Contrastive Learning for Class-Imbalanced Pneumonia Lesion Recognition in Chest X-rays - **Authors:** Xinxu Wei, Haohan Bai, Xianshi …

yhytoto12 updated 2 years ago
1
turboderp/exllamav2 #399

Input the embedding tensor into LLMs?

If I want to work with multimodal LLMs that takes in a set of embedding from vision/audio encoders, what is the proper way of inputting them into a LLM running using exllamav2? Can I just add a custo…

aliencaocao updated 2 weeks ago
41
yunqing-me/AttackVLM #7

Questions about attack on BLIP (LAVIS)

Thank you for releasing the codes and providing an in-depth analysis in the paper. I have the following two questions when reproducing the attack codes on the model `blip2` in `LAVIS_tool`. 1.…

ericyinyzy updated 8 months ago
3
open-compass/VLMEvalKit #150

IndexError: index 1 is out of bounds for dimension 0 with si…

cur_image_features = image_features[cur_image_idx] ~~~~~~~~~~~~~~^^^^^^^^^^^^^^^ IndexError: index 1 is out of bounds for dimension 0 with size 1 Is there any reason …

lucasjinreal updated 6 months ago
8
tianyi-lab/HallusionBench #8

[Results] Sharing HallusionBench results evaluated by VLMEva…

Dear authors, First, congratulations to your great work, which we think a valuable resource for evaluating the hallucination of VLMs. We have implemented HallusionBench in [VLMEvalKit](https://githu…

kennymckormick updated 9 months ago
1
OpenBMB/MiniCPM-V #212

lora微调grad_norm为nan，loss为0[BUG] <title>

### 是否已有关于该错误的issue或讨论？ | Is there an existing issue / discussion for this? - [X] 我已经搜索过已有的issues和讨论 | I have searched the existing issues / discussions ### 该问题是否在FAQ中有解答？ | Is there an existing…

tayton42 updated 1 month ago
6
linzhiqiu/t2v_metrics #7

Adapting VQAScore to a new VLM

Hey there, I am interested in running VQAScore with another VLM, CogVLM (see [here](https://huggingface.co/THUDM/cogvlm-chat-hf)). I was looking at the guidelines on how to adapt to another VQA model …

alzaia updated 3 months ago
10
vllm-project/vllm #8400

[Bug]: Pixtral leads to Expected at least 18286 dummy tokens…

### Your current environment H100 40GB ### Model Input Dumps _No response_ ### 🐛 Describe the bug ``` docker run -d --restart=always \ --runtime=nvidia \ --gpus '"device=MIG-2ea01c20-8…

pseudotensor updated 1 month ago
22
ndif-team/nnsight #85

Requesting support for input_embeds for tracer.invoke()

I'm using the `LanguageModel` class to wrap a vision-language model LLaVA, and during the execution of ```python with tracer.invoke(inputs) ``` [`nnsight/contexts/Invoker.py#L55`](https://github.…

HuFY-dev updated 7 months ago
4
NVlabs/RADIO #60

Use RADIOV2 as VLM's vision encoder.

Hello, thank you for your great work! We are currently exploring the utilization of radio as a vision encoder for vision language models. In our specific setup, we employ [SigClip](https://huggingfac…

echo840 updated 2 months ago
16

上一页 1...26 27 28 29 30 31 32...34 下一页

332 results for vlms

332 results
for vlms