gaomingqi / Track-Anything

Track-Anything is a flexible and interactive tool for video object tracking and segmentation, based on Segment Anything, XMem, and E2FGVI.
MIT License
6.51k stars 481 forks source link

Questions on refinement with SAM #116

Open kuaitoukid opened 1 year ago

kuaitoukid commented 1 year ago

In your paper, it is written that "Therefore, we utilize SAM to refine the masks predicted by XMem when its quality assessment is not satisfactory. Specifically, we project the probes and affinities to be point prompts for SAM, and the predicted mask from Step 2 is used as a mask prompt for SAM. Then, with these prompts, SAM is able to produce a refined segmentation mask."

My question is: does the quality assessment done automatically or by human efforts? Because I haven't seen the usage of sam_refinement in function vos_tracking_video.

Thank you for your open-source code and wish your reply!

CuriousTank commented 11 months ago

Same question

baoyi-A commented 9 months ago

Same question

xin-shinzheng commented 9 months ago

Same question