dorarad / gansformer

Generative Adversarial Transformers
MIT License
1.33k stars 149 forks source link

CUDA_ERROR_OUT_OF_MEMORY #20

Open petergerten opened 3 years ago

petergerten commented 3 years ago

I always get out of memory errors even when using all defaults and training low resolution. 8 * V100 16GB

petergerten commented 3 years ago

Trying to train on 1 GPU I get stuck here:


  File "/usr/local/lib/python3.6/dist-packages/numpy/lib/shape_base.py", line 577, in expand_dims
    if axis > a.ndim or axis < -a.ndim - 1:
TypeError: '>' not supported between instances of 'list' and 'int'
dorarad commented 3 years ago

Hi, Thank you for the interest in the work! I have couple deadlines over the next days so will definitely try to get back to you by the end of the week!

chalure commented 3 years ago

How many GPUs are required to train at least ?

dorarad commented 2 years ago

Hi, most sincere apologies for not getting back to it earlier! The model can be trained by even a single GPU.

On which line of the code did you get the error? Did you make changes by any chance in the implementation? The error seems to potentially indicate some small bug so further information could be helpful.

Couple more points:

I hope one of these might resolve the issue!