hustvl / YOLOS

[NeurIPS 2021] You Only Look at One Sequence
https://arxiv.org/abs/2106.00666
MIT License
827 stars 118 forks source link

How much GPU memory does the training need? #5

Closed LuckyDC closed 3 years ago

LuckyDC commented 3 years ago

How much GPU memory does the training need? It always raise out-of-memory error for me.

Yuxin-CV commented 3 years ago

Hi @LuckyDC, thanks for your interest in YOLOS.

We train all YOLOS models on 1 node with 8 x 11G 2080Ti GPUs.

Recently, we have access to an 8 x 24G 3090 GPUs node, we find it takes ~6 days to train 150 epochs for YOLOS-Small.

LuckyDC commented 3 years ago

Thanks for your reply. I will try gradient checkpointing~