large-vision-language-models Search Results

1000+ results
for large-vision-language-models

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

sangminwoo/awesome-vision-and-language #13

Add a CVPR 2024 paper

Could you add our CVPR 2024 paper about vision-language pertaining, "Iterated Learning Improves Compositionality in Large Vision-Language Models", into this repo? Paper link: https://arxiv.org/abs/…

hellomuffin updated 1 day ago
1
xp1632/Aalen_working_log #3

`ViperGPT` and its related paper : compositional visual infe…

- While the Latex env is not fully set, we'll write our thoughts here for now ---- - `ViperGPT` is a framework that leverages the pre-trained vision language models (`GLIP` for image object ground…

xp1632 updated 1 day ago
2
KejiaZhang-Robust/Adversarial-Robustness-Papers #1

NeurIPS 2023相关论文

同学你好，非常感谢你对这一系列论文的整理和梳理，真的帮助很大！在阅读文献时注意到，仓库中部分标注为“2024-NeurIPS”的论文是“2023-NeurIPS”。以下是我发现的相关论文列表，供参考： 2023-NeurIPS：[Enhancing Adversarial Contrastive Learning via Adversarial Invariant Regularizatio…

lightrain-a updated 1 day ago
3
VectorSpaceLab/Video-XL #13

Include discussions about VoCo-LLaMA in your paper.

Dear authors, @shuyansy @UnableToUseGit I kindly think you need to discuss VoCo-LLaMA[1] in the "Intro" section of your paper at the very least. As I find the citation and discussions related to …

Yxxxb updated 1 week ago
2
shikiw/Awesome-MLLM-Hallucination #2

Add Paper Request

Dear shikiw, Thank you for your valuable effort in curating research on MLLM hallucination! This excellent repository is impressively comprehensive and provides researchers with a clear sense of th…

Ruiyang-061X updated 5 hours ago
1
amir9979/reading_list #7077

Dave Van Veen - new related research

*Sent by Google Scholar Alerts (scholaralerts-noreply@google.com). Created by [fire](https://fire.fundersclub.com/).* --- ### ### ### [PDF] [Attention Prompting on Image for Large Vision-Language…

fire-bot updated 1 month ago
1
BradyFU/Awesome-Multimodal-Large-Language-Models #188

Inquiry for adding new paper

Hi, Thanks for your efforts on such a valuable collection! Could you please add the paper "Deciphering Cross-Modal Alignment in Large Vision-Language Models with Modality Integration Rate"? M…

shikiw updated 2 weeks ago
1
xp1632/DFKI_working_log #75

`MLMM: Multi Modal Large Language Model`

- Here's the summary of consulting a LLM specialist: --- - We have an initial thought in #74 as follows: ![image](https://github.com/user-attachments/assets/265a3d7d-0454-4e7b-9c99-a0dd9f9ecf7c…

xp1632 updated 5 days ago
2
InternLM/lmdeploy #2287

[Feature] Support for decoding method that reduce Hallucinat…

### Motivation Recently，there are many good paper that try to alleviating hallucinations for large vision-language models **during the decode process**，like： OPERA: Alleviating Hallucination in Mu…

zhly0 updated 2 months ago
1
ManifoldRG/MultiNet #196

MultiNet v0.1 Release Tracker

This is a master issue to track all items related to the November 1st MultiNet Release. The motivation & scoping for this release is below. We follow w/ the specific issues being tracked with specific…

harshsikka updated 3 weeks ago
1

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for large-vision-language-models

1000+ results
for large-vision-language-models