Closed rohit901 closed 1 year ago
@Went-Liang, could you please point me in the right direction on how to resolve this issue? The training seems to work fine for the first stage (first 12k iters, the moment iter = 12k, I got this error)
Starting the VOS training stage from later iters [more than 12k] seemed to work. I guess the model prediction was not stable enough and hence it was not working when the start iter was 12k with COCO.
@rohit901 Did you solve it, please?
@YH-2023 yes, just increase the start_iters of VOS training. I think default is 12k iters, increase it and increase total iters for training and it should work.
@rohit901 What is the start_iters you set? What about metrics tested on the coco dataset with VOS?
Hi, i'm trying to train this model on COCO dataset for 80k iters, keeping all the other parameters and config the same. The second stage VOS training starts from 12k iters as before.
However, i'm getting this error:
The issue seems to be in the below highlighted code block in
roihead_gmm.py
Not sure why I'm getting nan values, is it because of data pre-processing? I'm using coco_2017_train dataset for train instead of the earlier voc_custom_train. All other config params are the same.