google-research / deeplab2

DeepLab2 is a TensorFlow library for deep labeling, aiming to provide a unified and state-of-the-art TensorFlow codebase for dense pixel labeling tasks.
Apache License 2.0
1.01k stars 159 forks source link

Several Questions on TubeFormer-CVPR-2022. #136

Open lxtGH opened 2 years ago

lxtGH commented 2 years ago

We have several detailed questions since we cannot find the code.

1.For VSS task, Is the global memory the prediction kernel of last convolution? Did you use bipartite matching?

2.For VPS task on KITTI-STEP, in the section of “Global memory with split thing and stuff.” Did you use bipartite matching for stuff memory or directly use Cross Entropy Loss?

3.For both VPS and DVPS tasks, we are also confusing on the prediction label range in the section of “Global memory with split thing and stuff.” Dose the mask classification for thing and stuff is performed jointly or individually? (joint classification head for C{thing}+C{stuff} or two heads for C{thing} and C{stuff} to hand each.)

4.For DVPS task, how did you handle the un-labeled region on KITTI-DVPS since the labels are very sparse?

5,Will the code be released for reference? Thanks a lot!!!!!

lxtGH commented 2 years ago

Hi! We are big fans of your work. Could you help us to better understand your work? @mcahny @aquariusjay Thanks a lot !!!!!

mcahny commented 2 years ago

Hi, thanks for asking.

Thanks.

lxtGH commented 2 years ago

Thanks for your reply!! Dr.Dahun @mcahny As I prepare to re-implement your TubeFormer using Pytorch(mmdet). I want to know the details of mask based tracking part. Did you use ViP-like mask based tracking in off-line manner? Looking for you reply!!!