I'm not sure how the fine-tuning is performed for CVC. As far as I understand from other issues, you feed the network with the entire 25-frames sequence. Then, do you obtain a segmentation for every of these frames? Or do you get one unified segmentation for the entire sequence?
Hi, thanks for your nice work!
I'm not sure how the fine-tuning is performed for CVC. As far as I understand from other issues, you feed the network with the entire 25-frames sequence. Then, do you obtain a segmentation for every of these frames? Or do you get one unified segmentation for the entire sequence?