singhgautam / steve

Official code for Slot-Transformer for Videos (STEVE)
MIT License
41 stars 9 forks source link

Evaluation code? #2

Closed Wuziyi616 closed 1 year ago

Wuziyi616 commented 1 year ago

Hi, thanks for releasing this great work! I wonder if you also plan to release the evaluation code that computes the FG-ARI of predicted segmentation masks and the pre-trained weight on each dataset. I have one question regarding this metric:

singhgautam commented 1 year ago

Hi Ziyi,

Thank you for the encouraging comment!

Best, Gautam.

Wuziyi616 commented 1 year ago

Thank you for your prompt reply. Indeed, I also observe object splitting in SAVi (and, the model I'm currently working on :-)) when there are many unused slots. I'm looking forward to the dataset release as well.

Wuziyi616 commented 1 year ago

I apologize if I misunderstand the code, but according to these lines, it seems that you are using your own split of train-val-test, instead of the splits provided by MOVi authors?

Also, you use phase='full' to load the testing dataset. So it seems that you are reporting FG-ARI on all the videos in both train/val/test sets? Though I agree this may not be a big issue, since STEVE is an unsupervised segmentor, and it doesn't see the GT masks for the training set either.

singhgautam commented 1 year ago

All our metrics are computed on the official held-out sets. The 'full' applies to the data directory containing only the held-out videos.

singhgautam commented 1 year ago

I have now removed those lines to prevent confusion.

Wuziyi616 commented 1 year ago

I see, thanks, that's very clear now!