As you described in your paper: "Analogously, inference, in order to evaluate our results for zero-shot video object segmentation , the masks provided for the first frame in one-shot VOS are used to select which predicted instances are selected for evaluation", it is easy to do so in val set, but gt masks for test-dev are not given. How can you get the results for test-dev? Looking forward to your reply.
Thanks.
As you described in your paper: "Analogously, inference, in order to evaluate our results for zero-shot video object segmentation , the masks provided for the first frame in one-shot VOS are used to select which predicted instances are selected for evaluation", it is easy to do so in val set, but gt masks for test-dev are not given. How can you get the results for test-dev? Looking forward to your reply. Thanks.