Training time - Githubissues

juhongm999 / hsnet

Official PyTorch Implementation of Hypercorrelation Squeeze for Few-Shot Segmentation, ICCV 2021

231 stars 43 forks source link

Thanks for your interest in our research. We follow the standard early-stopping procedure in choosing the best performing model. By the best model, we mean the model at some epochs when the validation (mIoU) curve starts to saturate. As we normally do not know when such epochs are, one typically sets the number of iterations unbounded (niter=2000 in our case), keeps an eye on the training process, and picks the best model based on the validation performance. This is what we did in our experiments.

Validation takes a lot more time compared to training since the nworker for the validation dataloader is set to 0 whereas it is set to 8 for the training, in order to reproduce the exactly the same results as in our paper by removing stochasticity in sampling support examples. Note that there are some ways to get around this slow validation while keeping reproducibility by dynamically setting random seed in the dataloader with nworker > 1.

juhongm999 / hsnet

Training time #12