darglein / ADOP

MIT License
2.02k stars 197 forks source link

Multi-gpu support #64

Open parrot1166 opened 2 years ago

parrot1166 commented 2 years ago

Hello, is there a way to run training on multiple gpus? have a machine with 4 gpus and was wondering if there was a config option(or plans) to enable multi-gpu support.

Thanks!

darglein commented 2 years ago

Sorry this is currently not supported. You can however train multi instances on different GPUs, for example, to try out various settings.

RobotBytedance commented 1 year ago

Sorry this is currently not supported. You can however train multi instances on different GPUs, for example, to try out various settings.

I have try to set the traning on different GPU like this:

image

But, it still shows error and exits at training.

image

and the GPU shows that it have memory can work.

image

Is there something wrong?What else should I do?

darglein commented 1 year ago

To train on a specific GPU you can use the CUDA_VISIBLE_DEVICES environment variable. Don't change the global "device" variable for that.

RobotBytedance commented 1 year ago

To train on a specific GPU you can use the CUDA_VISIBLE_DEVICES environment variable. Don't change the global "device" variable for that.

Sorry for trouble you again. Where should I set the CUDA_VISIBLE_DEVICES?Could you please give me some resources or demo about that ?