Closed StephenEkaputra closed 1 year ago
Try with 8 A100s. It will be a lot faster, and per-model-trained will be cheaper.
Hi, We did not train on V100 GPUs, so no idea about training time on these GPUs.
We trained on A100 GPUs and it takes about 1.5-2 days for training.
Thank you for your reply
Hi, I wonder how long does it take for you to train MobileViT-S with 8 GPUs? I trained your model MobileViT-S with 1024 batch size (128*8) for 1 epoch with 8 V100 GPUs, but the training time is very slow. It costs like 40 minutes/epoch. For 300 epochs, it means more than 8 days. Is it normal?
Thank you