Closed richarddwang closed 3 years ago
@richarddwang Perhaps try setting CUDA_VISIBLE_DEVICES? Or do you still have an issue...
Thank you, @anshulsamar !
I set os.environ["CUDA_VISIBLE_DEVICES"] = "3"
and it use just cuda:3
May I ask one more question that now it takes only 305MiB on one GPU (GV100, fast as V100) and the training process is slow (1.1 % after 6hr), I am using default config. Is there any suggestion ?
No problem, not sure about that one...
I have a 4 gpu server, and I found even I just load data use input_func from
get_input_fn
, it use all my 4 gpus for 305Mib memory space each.