-
In the Line 85 of [SwinBERT](https://github.com/microsoft/SwinBERT/tree/main/prepro)/create_image_frame_tsv.py.
" current_image_path = previous_image_path "
Does it mean when the amount of extra…
-
Hi, thanks for great code and amazing work
I do my best to to make similar performance but I get some trouble.
Could I get some of your advice?
1. Download dataset
Unfortunately, some of datas…
-
hi @crodriguezo,
Can you share some details about the training details such as how long it will take, what is hardware/GPU was used?
Currently, on A100-80GB (24 CPUs), the training is too slow …
-
Hi, I am going to reproduce the reported performance on MSVD dataset with CIDEr of 120.6, but there exists a gap. In my experiment, the first evaluation after the initialization is poor, the initializ…
-
Hi, guys! Thank you for the project a lot. But I have an issue with downloading pretrained models using download_models.sh. I've tied different networks, but it fails all the time. Do you have another…
-
Hi,
Thanks for the nice library. I found DALI while looking for a video loader for action recognition. I found that DALI yet cannot handle various resolution as in the issue #725 which is necessary f…
-
Hi, How do you get the videos for the youcook2 datasets, since they only provide annotations? Would I need to download each video from youtube? Or do you provide embeddings for the videos?
Thanks
-
bash eval.sh
启动脚本如下:
```
#!/bin/bash
DIR="VTG-LLM"
MODEL_DIR="/home1/lw/fyy/VTG-LLM/vtgllm.pth"
# TASK='dvc'
# ANNO_DIR='data/VTG-IT/dense_video_caption/Youcook2'
# VIDEO_DIR='data/youco…
-
Hi, thanks for sharing this great work.
I am trying to train univl and got negative loss. Is it okay? Have you ever observed this issue? I am using small batch (12). The default epoch was 1 in the …
-
Hi, thanks for your great work of VideoChat2!
I tried to organize the Ego4d dataset used in the paper. But I found that there are several splits for each video, and the split information is unavail…