multimodal-llms Search Results

AkihikoWatanabe/paper_notes #1491

MM-Embed: Universal Multimodal Retrieval with Multimodal LLM…

# URL - https://arxiv.org/abs/2411.02571 # Authors - Sheng-Chieh Lin - Chankyu Lee - Mohammad Shoeybi - Jimmy Lin - Bryan Catanzaro - Wei Ping # Abstract - State-of-the-art retrieval mod…

AkihikoWatanabe updated 2 weeks ago

swiss-ai/mmore #10

Multimodality for final LLM

**New Feature**: The final LLM answering to the query should take as input the images extracted from the files. **Specification**: - Multimodal LLMs supported - Easy to extend for new architec…

lighthea updated 1 day ago

huggingface/candle #1947

Do you have any plans to support multimodal LLMs, such as MiniGPT-4/MiniGPT v2 (https://github.com/Vision-CAIR/MiniGPT-4/) and LLaVA (https://github.com/haotian-liu/LLaVA/)? That would be a significan…

guoqingbao updated 3 months ago

bentoml/OpenLLM #972

feat: Multimodal LLMs?

### Feature request Is it possible to rum multimodal LLMs like Qwen VL or LLaVa 1.5 using openllm? ### Motivation _No response_ ### Other _No response_

Trickster85 updated 4 months ago

brainhacklucca/brainhacklucca.github.io #11

Dreamcatcher: decoding dream-event in EEG data with multimod…

### Title Dreamcatcher: decoding dream-event in EEG data with multimodal language models and interpretability tools ### Leaders Lorenzo Bertolini ### Collaborators _No response_ ###…

lorenzoscottb updated 1 day ago

run-llama/llama_index #13507

[Feature Request]: Add Bedrock multimodal LLM integration

### Feature Description AWS Bedrock has a few multimodal LLMs such as Claude Opus. It would be great if this can be added as a multi-modal-llm integration. There is already an anthropic multimodal …

tituslhy updated 2 days ago

crewAIInc/crewAI #1577

[BUG]Not able to use long term memory with Azure Open AI

### Description When using memory=True for a crew that uses Azure Open AI, there is an error creating long term memory. ### Steps to Reproduce ``` import os from chromadb.utils.embedding_…

talrejanikhil updated 2 weeks ago

arcee-ai/mergekit #219

Merge Multimodal LLMs

Is it possible to merge multimodal LLMs? For example, could Llava and CodeLlama be merged? It might be beneficial for some software engineering tasks

TechxGenus updated 7 months ago

pytorch/executorch #7093

RFC: Improve developer experience by anchoring on multimodal…

### 🚀 The feature, motivation and pitch Let's build an example demo app, perhaps in pytorch-labs, which will become a forcing function to improve developer experience from a user perspective. A pos…

mergennachin updated 1 day ago

pytorch/torchchat #1334

Multimodal Eval Enablement (Looking for Developer to Impleme…

### 🚀 The feature, motivation and pitch ***Please note that since the actual implementation is going to be simple, and the design has already been reviewed, the purpose of this GitHub Issue is to l…

Olivia-liu updated 2 weeks ago

406 results
for multimodal-llms