MzeroMiko / VMamba

VMamba: Visual State Space Models,code is based on mamba
MIT License
2.16k stars 134 forks source link

GPU out of memory! #141

Open wsasdsda opened 6 months ago

wsasdsda commented 6 months ago

Hi, thanks for your contribution! When I try to finetune Vmamba on our own dataset (we use vssm_base_224 and load the vssmbase_dp06_ckpt_epoch_241.pth). Surprisingly, our GPU took up a lot of memory despite the Batchsize being set to 4, which made our training impossible on RTX3090 (24GB Memory). Could you please offer any solutions?

MzeroMiko commented 6 months ago

You can try VMambav2 base or even small, tiny model to save memory and FLOPs.