Closed Dratlan closed 4 months ago
After setting the os.environ['CUDA_LAUNCH_BLOCKING'] = '1', the erros change to:
Traceback (most recent call last):
File "/workspace/cpfs-data/code/video_transfer/StyleGANEX/scripts/train.py", line 32, in
It seems to be solve after i change a new cuda+pytorch version. now it is cuda11.6+pytorch1.13+cudnn8
Sorry to bother you again, I met a bug . I'm training video toonifiy in a single A100 GPU, as the 1th iter calculate the discriminator loss, this error shows. One thing I noticed is that the GPU memory used reaches 62.2GB/80GB at the peak, and then the error shows up. Can you give me some suggestions?