text-video-retrieval Search Results

PKU-YuanGroup/LanguageBind #38

Combination of multiple modalities

First of all congrats on the paper and thanks for providing the code! In the paper at 'Zero-shot language-based multi-modal joint retrieval' you mention that integrating/combining multiple embeddin…

anthony-mendil updated 3 days ago

microsoft/UniVL #21

Joint loss in pretraining

Hi, We found that video text joint loss in pretraining is calculated from masked video and text. Why not use the origin video and text like retrieval finetune? https://github.com/microsoft/UniVL/blo…

zhangliang-04 updated 2 years ago

pandacrypto/DSPA-Allocator #8

[DataCap Application] <jcphysics>

### Version 1 ### DataCap Applicant Black He ### Project ID 7 ### Data Owner Name jcphysics ### Data Owner Country/Region Singapore ### Data Owner Industry Education & Training ### Website…

simida1911 updated 1 week ago

SCUTlihaoyu/open-chat-video-editor #11

AssertionError: Torch not compiled with CUDA enabled

╭─────────────────────────────── Traceback (most recent call last) ────────────────────────────────╮████████████████████████████████████████████████████████████████████████████████████████████████████…

1893945 updated 1 year ago

OpenGVLab/unmasked_teacher #41

problems about zero_evaluation

Hello, Thanks for your great work! We'd like to run zero-shot evaluation on msrvtt qa task. However, following the readme below (set zero-shot evaluation and prepare dataset), we still encounter th…

Karine-Huang updated 3 months ago

TencentARC/MCQ #6

How to finetune on the MSRVTT

Hello, wonderful project!. Here I wonder how to finetune the pre-trained models on downstream video-text retrieval datasets like MSR-VTT, LSMDC, and MSVD? I notice that the script for zero-shot retrie…

ForawardStar updated 2 years ago

OpenGVLab/unmasked_teacher #49

unable to reproduce zero-shot results

Hey - I am unable to reproduce the reported zero-shot results. So far I tried it on MSRVTT and MSVD, I would appreciate it if you kindly have a look. Here is what I got after running these 2 script…

pritamqu updated 2 days ago

PaddlePaddle/PaddleVideo #396

A little request about the application of T2VLAD.

Thanks for your extraordinary work of video-text retrieval with T2VLAD. Here, I have a little request about this work: could you share the other dataloaders, configs of MSR-VTT at 1k-A split, MSVD an…

YangYang updated 6 months ago

run-llama/llama_index #13943

[Question]: I want to buid a local multimodal RAG chatbot, b…

### Question Validation - [X] I have searched both the documentation and discord for an answer. ### Question ``` # This class will transform video to text and images class VideoProcessor: de…

LJ-Hao updated 15 hours ago

towhee-io/examples #235

How to free GPU memory

When I write the text video retrieval function as a Python script, when using the function, GPU video memory increases with the increase of the number of uses (search), and the kill script is released…

boatingMen updated 11 months ago

1000+ results for text-video-retrieval

1000+ results
for text-video-retrieval