multimodal-retrieval Search Results

AkihikoWatanabe/paper_notes #1491

MM-Embed: Universal Multimodal Retrieval with Multimodal LLM…

# URL - https://arxiv.org/abs/2411.02571 # Authors - Sheng-Chieh Lin - Chankyu Lee - Mohammad Shoeybi - Jimmy Lin - Bryan Catanzaro - Wei Ping # Abstract - State-of-the-art retrieval mod…

AkihikoWatanabe updated 2 weeks ago

vllm-project/vllm #7983

[Usage]: Deploying multimodal retrieval models

### Your current environment ```text The output of `python collect_env.py` ``` ### How would you like to use vllm I want to run inference of [ColPali](https://huggingface.co/vidore/colpali). I …

sky-2002 updated 1 month ago

logancyang/obsidian-copilot #716

[PLUS] Multimodal retrieval - embed images and pdfs

**Is your feature request related to a problem? Please describe.** - obsidian in many usecases can contain a lot of non-text message, unstructured data such as 1. imgs 2. pdfs 3. pdfs with c…

jzhao62 updated 1 month ago

HKUDS/LightRAG #237

Regarding modifying LightRAG for multimodal tasks

I am currently planning to prepend an image to the query section, meaning the query will consist of an image along with a question about it. The system will then search the provided documents to find …

SLKAlgs updated 1 week ago

comet-ml/opik #567

[FR]: Enable Opik to display additional media formats, inclu…

### Proposal summary ## Feature Request Enable Opik to display additional media formats, including audio, PDF, and video. ## Background Opik currently supports only image display, which li…

pleomax0730 updated 2 weeks ago

huggingface/huggingface-llama-recipes #64

Add Multimodal RAG (Text + Image) for Retrieval-Augmented Ge…

A notebook that demonstrates how to use a multimodal RAG that combines two types of inputs, such as text and images, to retrieve relevant information from a dataset and generate new outputs based on t…

MayankChaturvedi updated 1 month ago

cheshire-cat-ai/core #564

Support for model multimodal

**Is your feature request related to a problem? Please describe.** I'm frustrated when I can't use multimodal models like "gpt-4-vision-preview" in Cheshire-cat-ai to process and retrieve information…

Jhonnyr97 updated 1 month ago

pyOpenSci/software-submission #217

podcastfy

Submitting Author: Tharsis Souza (@souzatharsis) Package Name: podcastfy One-Line Description of Package: Transforming Multimodal Content into Captivating Multilingual Audio Conversations with Gen…

souzatharsis updated 3 days ago

FlagOpen/FlagEmbedding #1181

Visualized BGE based on BAAI/bge-base-zh-v1.5

Is there any versions for the model of **Visualized BGE based on BAAI/bge-base-zh-v1.5**?And how does the BAAI/bge-visualized-m3 performance compared with ChineseCLIP?

hoshinory updated 2 weeks ago

run-llama/create-llama #371

Support Multimodal RAG from LlamaCloud

see https://www.llamaindex.ai/blog/multimodal-rag-in-llamacloud

marcusschiesser updated 1 month ago

291 results for multimodal-retrieval

291 results
for multimodal-retrieval