-
Hello, I am trying to evaluate VATT on YouCook2 dataset for text-video retrieval. I am having errors trying to load a previous checkpoint among many other package issues with tensorflow v2.7, DMVR, an…
-
您好,“A Comprehensive Empirical Study of Vision-Language Pre-trained Model for Supervised Cross-Modal Retrieval” 这个工作是一个十分有价值的工作。我仔细学习了您提供的实验代码。不过发现了一个小问题,这个问题可能会避免日后研究的一些异常情况。
您对于缺失场景的实现策略似乎是这样的:
(…
-
[](https://issuehunt.io/r/mtxr/vscode-sqltools/issues/110)
### Issue Type
* [ ] Bug
* [ ] Enhancement
* [X] Feature Request
* [ ] Question
* [ ] Other
### Prerequisites (For bugfixes)
…
-
您好,我按照您公开在github上的DCHUC代码在Flickr数据集上测试,请问一下论文中的实验结果是迭代多少轮的情况?
-
Hi~ Nice work! In my understanding, evaluation code split embedding features into mini-batches to calculate attention (cross-modal interactions), and splice them into a whole similarity matrix. The ab…
-
hi,大家好,非常高兴的告诉大家,百度飞桨论文复现赛第五期已经开始了,本次**论文复现赛**共将有100篇的经典&前沿论文供大家复现,以及新增了**工程落地赛**,详细信息可以参考[AI Studio](https://aistudio.baidu.com/aistudio/competition/detail/126/0/introduction),大家是否已经迫不及待了呢~
为了帮助…
-
Hi, I found your work to be very interesting.
But I am a bit confused about your loss functions.
You computed the i2t_loss and t2i_loss separately but aren't they the same?
Am i getting something w…
-
Hi, I went through the WebQA_train_val.json and found out of 41739 examples only 21465 has positive image ids? So is this normal or I did some mistake during the preprocessing?
-
It seems you did not include the cross-modal retrieval task in the paper for now, do you plan to add the experiments?
-
Title: Cross-Modal Center Loss for 3D Cross-Modal Retrieval
Paper Link: https://arxiv.org/abs/2008.03561
Code Link: https://github.com/LongLong-Jing/Cross-Modal-Center-Loss