video-clip-retrieval Search Results

315 results
for video-clip-retrieval

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

microsoft/RAG_Hack #116

VidSage: Video Insights using Graph RAG

### Project Name VidSage ### Description # VidSage: Video Insights using Graph RAG https://www.youtube.com/watch?v=IUSCWtB9jWk VidSage focuses on processing video data, storing it in Azur…

MayankKeshariC5 updated 2 weeks ago
1
facebookresearch/fairseq #4410

Inferior performance of VideoClip on Video-text retrieval ta…

We test the performance of VideoClip through the video-text retrieval task on the COIN dataset, but the performance is much lower than the reported performance of VideoQA (26%

DuL1nk updated 2 months ago
1
uhhyunjoo/paper-notes #5

[arXiv 2021] CLIP4Clip: An Empirical Study of CLIP for End t…

||link| |----|---| |paper| [CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval](https://arxiv.org/pdf/2104.08860v2.pdf) | |code| [papers with code](https://paperswithcode.com…

uhhyunjoo updated 2 years ago
2
ikuinen/CMIN_moment_retrieval #14

Bug in data loader?

In[ CMIN_moment_retrieval/dataloaders/clip_loader.py line 66 ](https://github.com/ChenyunWu/CMIN_moment_retrieval/blob/df44a230a0cd83d9ab3e282601da60cbca56a102/dataloaders/clip_loader.py#L66) `if lab…

ChenyunWu updated 3 years ago
3
iejMac/video2dataset #254

BPE tokenizer

Could be fun to have a tokenizer like "take all video frames, apply clip, transform into cluster N of 2^17 (what I have in clip retrieval index), apply BPE, return sequence" Inspired by https://arx…

rom1504 updated 1 year ago
2
open-mmlab/OpenMMLabCourse #49

How to extract image embeddings and caption embeddings ? Tha…

I am a student in Toronto learning about multimodal models and multimodal retrieval. Can embeddings be extracted from your models ? I would like to compare retrieval results from your model to CLIP.…

jkentmanning updated 9 months ago
1
OpenGVLab/unmasked_teacher #49

unable to reproduce zero-shot results

Hey - I am unable to reproduce the reported zero-shot results. So far I tried it on MSRVTT and MSVD, I would appreciate it if you kindly have a look. Here is what I got after running these 2 script…

pritamqu updated 2 months ago
8
farewellthree/STAN #8

About the weights of original CLIP layer

Are the weights of original CLIP layer always frozen during the whole training process?

LLFabiann updated 1 year ago
1
simon-ging/coot-videotext #44

How do you look at retrieved text data?

Hello, I have run the training and embedding extraction and I'm wondering how I can see any examples of text that the model retrieved. The embeddings and h5 files seem to be mostly numeric--How do …

mtsandmeyer updated 2 years ago
1
PKU-YuanGroup/LanguageBind #38

Combination of multiple modalities

First of all congrats on the paper and thanks for providing the code! In the paper at 'Zero-shot language-based multi-modal joint retrieval' you mention that integrating/combining multiple embeddin…

anthony-mendil updated 1 month ago
7

上一页 1...1 2 3 4 5 6 7...32 下一页

315 results for video-clip-retrieval

315 results
for video-clip-retrieval