cswry / SeeSR

[CVPR2024] SeeSR: Towards Semantics-Aware Real-World Image Super-Resolution
Apache License 2.0
390 stars 25 forks source link

OOM error happend when using accelerate but training works fine for single GPU #39

Open Synapsess opened 5 months ago

Synapsess commented 5 months ago

感谢你精彩的工作。我尝试微调SeeSR。由于显存限制,我将图片大小缩小为256×256。在单卡上使用 python train_seesr.py ##省略参数 会占据23GB的显存,可以在4090上运行。但当我尝试多卡 CUDA_VISIBLE_DEVICES="0,1" accelerate launch train_seesr.py ##省略参数 时总是发生OOM错误。我已经仔细检查输入tensor的形状,确保与单GPU时一致,但是找不到原因。感谢您的帮助!