Open join-ville opened 2 years ago
Hi, have you tried training multiple times? 8.53 is not a typical result. If you keep getting results like that, there might be something wrong, maybe the version of some libs or envs.
we only use flip and random resize
I trained COCO+LVIS SegFormerB0-S1 using the trainval_scripts/train_focalclickB0_S1_cclvs.sh with nothing changed. And the val result is as follows:
compared with the model published:
I notice that the BS is 32 in paper, and it's 64 in the bash script. Is it the reason for the accuracy gap?
Also, there are two ways of data augmentation in the code and the paper says "During training, we only use flip and random resize with the scale from 0.75 to 1.4 as data augmentation." which one is used in the model published?