GPU out of memory！ - Githubissues

MzeroMiko / VMamba

VMamba: Visual State Space Models，code is based on mamba

MIT License

2.16k stars 134 forks source link

GPU out of memory！ #141

Open wsasdsda opened 6 months ago

wsasdsda commented 6 months ago

Hi, thanks for your contribution! When I try to finetune Vmamba on our own dataset （we use vssm_base_224 and load the vssmbase_dp06_ckpt_epoch_241.pth). Surprisingly, our GPU took up a lot of memory despite the Batchsize being set to 4, which made our training impossible on RTX3090 (24GB Memory). Could you please offer any solutions？

MzeroMiko commented 6 months ago

You can try VMambav2 base or even small, tiny model to save memory and FLOPs.