Open SartisticV opened 8 months ago
Hi! I have a question regarding the code. Why is the decision made to sample from all video frames when the number of reference is greater than 10? I cant seem to find it in the paper.
https://github.com/SJTU-LuHe/TransVOD/blob/5a4464084b166e40680b8a071d9756f847876acc/datasets/vid_multi.py#L75-L76
In addition, why is the sampling strategy different during evaluation?
Hi! I have a question regarding the code. Why is the decision made to sample from all video frames when the number of reference is greater than 10? I cant seem to find it in the paper.
https://github.com/SJTU-LuHe/TransVOD/blob/5a4464084b166e40680b8a071d9756f847876acc/datasets/vid_multi.py#L75-L76