Now, my data has video frames + segmentation mask for each frame. My target: creating my dataset to evaluate Cotracker in my case study.
Can you give me dataset structure example?
On github you mentioned download dataset "TAP-Vid". Can you explain more?
Or you can send me of "DAVIS First" test set that you used. My email: ah23975@bristol.ac.uk
As I understand: inputs are Frames + segmentation mask + raft masks and the label (annotation) is list points track in every frame. Do I missing some thing? Please let me know your pre-processing.
Now, my data has video frames + segmentation mask for each frame. My target: creating my dataset to evaluate Cotracker in my case study. Can you give me dataset structure example?
Thanks Dat