Hi, @Alxx999! Thanks for your attention in our work.
fpn_levels and gt_inds indicates the corresponding fpn feature and ground truth for each predicted instance, respectively. You can try running CondInst to get details. As we sample a pair of frames from one video in a batch, we use 0 and 1 to identify two frames.
inst_im_inds and inst_rel_ids are used to find the instance relations between a pair of frames.
Hi, @Alxx999! Thanks for your attention in our work.
fpn_levels
andgt_inds
indicates the corresponding fpn feature and ground truth for each predicted instance, respectively. You can try runningCondInst
to get details. As we sample a pair of frames from one video in a batch, we use 0 and 1 to identify two frames.inst_im_inds
andinst_rel_ids
are used to find the instance relations between a pair of frames.