Open georgh opened 6 years ago
Main remaining question would be, how you use the cpu during the gpu computation. Do you split with multiprocessing or is there an easier way?
@georgh can you send this as a PR ?
Main remaining question would be, how you use the cpu during the gpu computation.
The gpu ops are asynchronous. You can other stuff on the cpu as long as you dont run any synchronizing functions (af.sync or any function that copies memory back to the cpu).
I think it would be great to have an example for using multiple GPUs.
Here is what I tried. If thats the right way to do it, then you may add it as an example. It seems to scale fine (tested up to 7 GPUs) and nvidia-smi reports 96% util.