med-air / Endo-FM

[MICCAI'23] Foundation Model for Endoscopy Video Analysis via Large-scale Self-supervised Pre-train
Apache License 2.0
162 stars 16 forks source link

CVC training #22

Open jmorlana opened 4 months ago

jmorlana commented 4 months ago

Hi, thanks for your nice work!

I'm not sure how the fine-tuning is performed for CVC. As far as I understand from other issues, you feed the network with the entire 25-frames sequence. Then, do you obtain a segmentation for every of these frames? Or do you get one unified segmentation for the entire sequence?

Kyfafyd commented 4 months ago

Hi, @jmorlana thanks for your interest! We obtain the segmentation for every of the input frames.