Open CxuS opened 5 years ago
Hi @CxuS, I encourage you to work on Something-Something V2 because the video resolution is higher (minimum high/weight = 256). I have been using Something-Something v1 for this project and the object detection predictions made by MaskRCNN were not perfect compared to VLOG for example. You can use the torchvision pipeline for running maskrcnn on the videos. Thanks poiting out this mistake, you are right there are 174 I did a mistake in the paper. I will update this section.
Thank you for your reply and your advice. Best regards !
here have something questions. 1、I want to use this project on "Something-something", but i haven't this mask data. 2、The dataset of "something-something" with 174 classes in 20bn. but I find only 157 classes in your paper. so I don't know that whether "SS" in 20bn .