Open lxtGH opened 2 years ago
Hi! We are big fans of your work. Could you help us to better understand your work? @mcahny @aquariusjay Thanks a lot !!!!!
Hi, thanks for asking.
Thanks.
Thanks for your reply!! Dr.Dahun @mcahny As I prepare to re-implement your TubeFormer using Pytorch(mmdet). I want to know the details of mask based tracking part. Did you use ViP-like mask based tracking in off-line manner? Looking for you reply!!!
We have several detailed questions since we cannot find the code.
1.For VSS task, Is the global memory the prediction kernel of last convolution? Did you use bipartite matching?
2.For VPS task on KITTI-STEP, in the section of “Global memory with split thing and stuff.” Did you use bipartite matching for stuff memory or directly use Cross Entropy Loss?
3.For both VPS and DVPS tasks, we are also confusing on the prediction label range in the section of “Global memory with split thing and stuff.” Dose the mask classification for thing and stuff is performed jointly or individually? (joint classification head for C{thing}+C{stuff} or two heads for C{thing} and C{stuff} to hand each.)
4.For DVPS task, how did you handle the un-labeled region on KITTI-DVPS since the labels are very sparse?
5,Will the code be released for reference? Thanks a lot!!!!!