youcook2 Search Results

118 results
for youcook2

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

zjr2000/GVL #4

batch_size setting

I wonder why batch_size is set to 1, does the bigger batch_size cause the worse results?

wenjiajia123 updated 1 year ago
2
uhhyunjoo/paper-notes #2

[ICCV 2019] HowTo100M: Learning a Text-Video Embedding by Wa…

||link| |----|---| |paper| [HowTo100M: Learning a Text-Video Embedding by Watching Hundred Million Narrated Video Clips](https://openaccess.thecvf.com/content_ICCV_2019/papers/Miech_HowTo100M_Learni…

uhhyunjoo updated 2 years ago
5
microsoft/UniVL #46

Zero score (every output is None) on evaluation captioning w…

I tried to run and eval captioning with your pretrained model and YouCookII dataset, but met this issue. I follow the instructions, but every hyp on eval_epoch is none and every scores is 0.0 (do not…

Borntowarn updated 1 year ago
1
microsoft/SwinBERT #6

When will you release the tutorial for frame-based TSV gener…

Besides, how can we prepare the data files like *.label.tsv / *.caption.tsv / *.caption.linelist.tsv to train SwinBert on our own dataset? Thank you very much ~

yaolinli updated 7 months ago
12
microsoft/SwinBERT #38

Train MSVD dataset using VATEX pretrained model? Thanks

Hi, I am going to reproduce the reported performance on MSVD dataset with CIDEr of 120.6, but there exists a gap. In my experiment, the first evaluation after the initialization is poor, the initializ…

franciszchen updated 1 year ago
1
microsoft/UniVL #40

How to only input text feature or video feature

I want to only input text feature or video feature in UniVL. In this paper, it said that one transformer combines text representation **T** and video representation **V**. Could you tell me how to cha…

tingchihc updated 1 year ago
2
OpenGVLab/Ask-Anything #250

Videochat 2 IT dataset size

I know that's a long shot, but has anyone downloaded the whole dataset and can tell me how much GB/TB I can expect it to be? Thank you

joslefaure updated 4 days ago
2
gyxxyg/VTG-LLM #25

The result for ActivityNet

Hi, Thank you for sharing your impressive work! Equipping LLMs with temporal understanding is indeed a challenging task. I have a question regarding the ActivityNet results: Are the scores you r…

weiyuan-c updated 1 month ago
4
microsoft/UniVL #27

TypeError: bad operand type for unary -: 'list'

Hello, there is an error at '-x' in the following code, is it a problem with the numpy version? ``` import numpy as np def compute_metrics(x): sx = np.sort(-x, axis=1) d = np.diag(-x)…

jxrloveyou updated 1 year ago
6
simon-ging/coot-videotext #51

Feature representations for external videos

Hi, thank you for sharing your work and congratulations on the paper! I am trying to use COOT to create video descriptions for videos that aren't in ActivityNet. I saw your [comment ](https://githu…

arelhossan updated 2 years ago
1

上一页 1...1 2 3 4 5 6 7...12 下一页

118 results for youcook2

118 results
for youcook2