Closed LieuMai closed 2 years ago
Thanks for your interest in our project!
The reason for "CUDA out of memory" is that 2G memory is too small to train the model.
Our model is trained on 8 V100 (32G) GPUs, you could try a GPU with more memory (i.e. 16GB or larger) and set "--max-sentences=1" to save memory.
Describe the bug
When trying to train the model. I got this error. have searched related issues but cannot get the expected help.
Thank you in advance for any insights you can give.
Reproduction
sh seqco_scripts/train_cnndm.sh
+-----------------------------------------------------------------------------+ | NVIDIA-SMI 510.85.02 Driver Version: 510.85.02 CUDA Version: 11.6 | |-------------------------------+----------------------+----------------------+ | GPU Name Persistence-M| Bus-Id Disp.A | Volatile Uncorr. ECC | | Fan Temp Perf Pwr:Usage/Cap| Memory-Usage | GPU-Util Compute M. | | | | MIG M. | |===============================+======================+======================| | 0 NVIDIA GeForce ... Off | 00000000:01:00.0 Off | N/A | | N/A 46C P8 N/A / N/A | 4MiB / 2048MiB | 0% Default | | | | N/A | +-------------------------------+----------------------+----------------------+
+-----------------------------------------------------------------------------+ | Processes: | | GPU GI CI PID Type Process name GPU Memory | | ID ID Usage | |=============================================================================| | 0 N/A N/A 1259 G /usr/lib/xorg/Xorg 4MiB | +-----------------------------------------------------------------------------+