-
I'm curious about how to do cross-modal retrieval with the YouTube-8M dataset. I have videos with image and audio data, and would like to learn two encoders that embed both audio and RGB data into the…
-
Thanks for the great work.
I evaluate the zero-shot performance of the 25M pre-trained ckpt on the DiDeMo dataset, my command is
```
export VL_DATA_DIR=/home/renshuhuai/VindLU/
export VL_EXP_DIR…
-
I am bit confused about the final result by running the main.py when inferencing. Are they just a couple of feature embeddings for the input images?
-
## AAAI-24
Benchmarking Large Language Models in Retrieval-Augmented Generation
https://arxiv.org/abs/2309.01431
Hot or Cold? Adaptive Temperature Sampling for Code Generation with Large Langua…
-
## Feature Request
**Is your feature request related to a problem or unsupported use case? Please describe.**
Make accessible the completion threshold of a video to the frontend. This metric was s…
-
### Project name
VideoQuery.ai
### Description
This is a Simple Chat Application that allows users to search youtube videos and give the URL to the chat app and then chat with AI ChatBot rega…
-
Hi.
I read your "Dual Encoding for Video Retrieval by Text" which accepted by TPAMI2021. But the code repository(https://github.com/danieljf24/hybrid_space) mentioned in the paper is not existed an…
-
```
Add video metadata writer/retriever, would be really useful for android where
there is a high API level requirement for metadata retrieval, also
FFmpegFrameRecorder could have an option to write…
-
```
Add video metadata writer/retriever, would be really useful for android where
there is a high API level requirement for metadata retrieval, also
FFmpegFrameRecorder could have an option to write…
-
```
Add video metadata writer/retriever, would be really useful for android where
there is a high API level requirement for metadata retrieval, also
FFmpegFrameRecorder could have an option to write…