Closed z-jiaming closed 1 year ago
Hi Jiaming,
unfortunately, I don't have the capacity to properly realize all the baselines, but here's a dump of my local XMem repo: https://drive.google.com/file/d/1IcdQOieXBa2LNtn1WYiMR5fdGC82B2ya/view?usp=sharing. Hope it helps!
This is a great help, thank you so much!
I looked at the code and found that you are loading the training set for VOST using the balanced_train.txt in def load_sub_ours, which is not provided by the original dataset. We would like to ask if you modified the training set when training the XMem_baseline, for example by deleting some videos?
Thanks again for your help!
Sorry for the confusion. This must be an artifact from when I was constructing the splits. The reported numbers were obtained with the final split which is released with the dataset.
Thanks for your reply!
Sorry, I have one more question. In aot_plus you used 1) merge_sample and 2) filer out those above ignore_thresh (https://github.com/TRI-ML/VOST/blob/398bbc2ee5dcd6eef3a508532f7ac1e2962df601/aot_plus/dataloaders/train_datasets.py#L340), but not in XMem of this code. Did you try them in XMem?
Thank you very much for your reply and congratulations again on your work!!!
Thanks for your nice work!
Could you please share your code that trains the XMem? It would be a great help if you could!