-
Hello, thank you for your work. I would like to ask why you think the task of synchronized subtitles is important. How can it help in action generation and action understanding?
-
Thank you for your great open-source code, I am excited for the outstanding zero-shot performance over video-text retrieval. Can you share the inference code for video-text retrieval on MSRVTT, thank…
-
ICCV 21
一句话:在视频-文本匹配任务中,同时考虑了全局特征和局部特征,并且使用了一种高效的方式处理局部特征的对齐。
之前的方法主要是将视频的表示和文本的表示拉近,作者认为这种方式会损失很多细粒度的信息,于是作者考虑了局部信息。作者将视频分为若干个segment,每个segment的表示作为视频的local表示,将所有local表示使用max pooling融合,即得到视频的glob…
-
||link|
|----|---|
|paper| [Disentangled Representation Learning for Text-Video Retrieval](https://arxiv.org/abs/2203.07111) |
|code| [papers with code](https://paperswithcode.com/paper/disentangle…
-
直接跑demo/demo.ipynb, 模型选用https://huggingface.co/OpenGVLab/InternVideo2-Stage2_1B-224p-f4/blob/main/InternVideo2-stage2_1b-224p-f4.pt 发现效果不太理想。
首先需要修改两个地方才能正确加载模型:
1、demo/demo.ipynb 中在setup_internvid…
-
### Question Validation
- [X] I have searched both the documentation and discord for an answer.
### Question
I am Referring to this example: https://www.llamaindex.ai/blog/multimodal-rag-for-advanc…
-
### Data Owner Name
Fengwo Extraordinary
### Data Owner Country/Region
China
### Data Owner Industry
Resources, Agriculture & Fisheries
### Website
https://www.qcc.com/firm/97d395cae168248fb491…
-
### Version
1
### DataCap Applicant
Beijing Mipai Culture Media(北京觅拍文化传媒)
### Project ID
7
### Data Owner Name
Beijing Mipai Culture Media
### Data Owner Country/Region
China
### Data Owner …
-
Azure Open AI is services from Auzure platform for Generative AI
Here we can perform search
I has APIs using REST, we can
Dense Captions. : For every Item detected in the image, it can genera…
-
Implementation of the ScienceDirect: [Object Retrieval API](https://dev.elsevier.com/documentation/ObjectRetrievalAPI.wadl). The description of the API states: These interfaces represent retrieval of …