Closed BichengYing closed 4 years ago
Temporarily, we use BLUEFOG_WIN_ON_CPU=1 flag so that Bluefog will copy gpu tensor to cpu, then communication through the cpu, after communication is done, transform it back to gpu.
Problem solved. Docker has to be run under privileged mode, namely, just need to add "--privileged" flag.
Also win-ops communication with 4.0 openmpi is not supported yet.