multimodal-retrieval Search Results

291 results
for multimodal-retrieval

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

amir9979/reading_list #7077

Dave Van Veen - new related research

*Sent by Google Scholar Alerts (scholaralerts-noreply@google.com). Created by [fire](https://fire.fundersclub.com/).* --- ### ### ### [PDF] [Attention Prompting on Image for Large Vision-Language…

fire-bot updated 1 month ago
1
bigshanedogg/survey #23

An Empirical Study of GPT-3 for Few-Shot Knowledge-Based VQA

## Problem statement 1. performance bottleneck in knowledge-based VQA due to two-phase architecture consists of knowledge retrieval from external soruces and training question answering task in super…

bigshanedogg updated 2 years ago
1
OpenGVLab/unmasked_teacher #49

unable to reproduce zero-shot results

Hey - I am unable to reproduce the reported zero-shot results. So far I tried it on MSRVTT and MSVD, I would appreciate it if you kindly have a look. Here is what I got after running these 2 script…

pritamqu updated 2 months ago
8
CsabaConsulting/InspectorGadgetApp #48

RAG: Upgrade text embedding from text-embedding-004 to text-…

CHANDRA got me thinking about the new `text-embedding-preview-0815` model to upgrade from `text-embedding-004`. However https://github.com/GoogleCloudPlatform/vertex-ai-samples/blob/main/notebooks/off…

MrCsabaToth updated 1 month ago
16
GoogleCloudPlatform/generative-ai #457

[Bug]: Vector Search Index creation failed due to InternalSe…

### Contact Details _No response_ ### File Name `gemini/use-cases/retrieval-augmented-generation/multimodal_rag_langchain.ipynb` ### What happened? The following step failed with error …

JasperW01 updated 1 month ago
6
LAION-AI/Open-Assistant #3007

Next Iteration Meeting (Friday, May 5, 2023 7:00pm UTC)

Topics for the next meeting

AbdBarho updated 1 year ago
14
amusi/CVPR2024-Papers-with-Code #210

欢迎分享CVPR 2024 论文和代码 / Welcome to share the paper and code of…

[The format of the issue] Paper name/title: Paper link: Code link:

amusi updated 2 months ago
83
facebookresearch/segment-anything #6

predictor with text prompt using CLIP

Hi, I have implemented text prompt-controlled segmentation using selective search and CLIP. Can you suggest any additional techniques I can include? I am considering trying CLIP-GradCAM #4 h…

Usama3059 updated 1 year ago
18
XpressAI/xai-llm-server #2

Feature Request: Add support for Llama-3.2-11B-vision/

### Problem We want to add support for this new model that unlike the previous ones also supports vision. The readme for the model is described below: --- language: - en - de - fr - it - pt…

wmeddie updated 2 months ago
3
googleapis/python-aiplatform #3775

Including Tools prevents Gemini from providing a natural lan…

#### Environment details - OS type and version: Mac OS, Python #### Steps to reproduce 1. Include a Tool in the GenerativeModel.generate_content call. Don't specify any System Instruction…

expresspotato updated 1 month ago
14

上一页 1...7 8 9 10 11 12 13...30 下一页

291 results for multimodal-retrieval

291 results
for multimodal-retrieval