vision-language-model Search Results

1000+ results
for vision-language-model

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

vllm-project/vllm #416

[Feature Request] Support input embedding in `LLM.generate()…

Hi, I am using llm as part of a multimodal model, so the model needs to pass `input embedding tensor` directly to generate, and also need to access the language model's `embed_tokens` member to fist c…

KimmiShi updated 2 days ago
15
bridge-23/Agentic-Web #1

🚀 Agentic Web Platform - Product Requirements Document

## Executive Summary The **Agentic Web Platform** is an advanced AI-driven dashboard that empowers users to customize, manage, and optimize their AI agents with unparalleled precision and control.…

wirapratamaz updated 1 day ago
1
sgl-project/sglang #1487

Development Roadmap (2024 Q4)

Here is the development roadmap for 2024 Q4. Contributions and feedback are welcome ([**Join Bi-weekly Development Meeting**](https://t.co/4BFjCLnVHq)). Previous 2024 Q3 roadmap can be found in #634. …

Ying1123 updated 1 day ago
11
icereed/paperless-gpt #43

Document loading takes excessively long and no documents are…

TheThere is a significant performance issue when loading documents: the process takes an unusually long time, and afterward, no documents are found. To test the system, I tagged several documents with…

BlackJoker90 updated 5 days ago
10
irthomasthomas/undecidability #892

Vespa 🤝 ColPali: Efficient Document Retrieval with Vision La…

- [ ] [Vespa 🤝 ColPali: Efficient Document Retrieval with Vision Language Models — pyvespa documentation](https://pyvespa.readthedocs.io/en/latest/examples/colpali-document-retrieval-vision-language-m…

ShellLM updated 3 months ago
1
BradyFU/Awesome-Multimodal-Large-Language-Models #188

Inquiry for adding new paper

Hi, Thanks for your efforts on such a valuable collection! Could you please add the paper "Deciphering Cross-Modal Alignment in Large Vision-Language Models with Modality Integration Rate"? M…

shikiw updated 2 weeks ago
1
containers/ramalama #150

Vision models

## Value Statement As someone who wants a boring way to use AI I would like to expose an image/PDF/document to the LLM So that I can make requests and extract information, all within Ramalama …

p5 updated 1 month ago
10
inikishev/torchzero-old #3

SPSA convergence

Hi inikisheve, I use spsa random noise (RDSA) to add little noise into loss function to perturb image as adversarial image to vision-language models. Here I use instrutblip model. However, it does …

guanhdrmq updated 2 weeks ago
18
InternLM/lmdeploy #1514

[Bug] Issues Running Vision Language Models in Docker

### Checklist - [X] 1. I have searched related issues but cannot get the expected help. - [X] 2. The bug has not been fixed in the latest version. ### Describe the bug Hi folks, thanks for t…

ghost updated 5 months ago
8
huggingface/optimum-benchmark #295

Vision language model support

Hello! 💗 When trying to run benchmarks on vision language models (image-text-to-text) I realized this library doesn't support this task. It would be nice to have a support for it since these models ar…

merveenoyan updated 2 days ago
1

上一页 1...2 3 4 5 6 7 8...100 下一页

1000+ results for vision-language-model

1000+ results
for vision-language-model