Open ADAM-CT opened 4 years ago
I'm not completely sure -- I think it was just the default value.
The default value is only one value (1000000000), only suitable for single server.I think our experimental environment is very similar, otherwise the performance will not be improved so much.
First of all, I want to say that your work is amazing. I used you in vgg16. gpus = 16. hybrid_conf.json. (https://github.com/msr-fiddle/pipedream/blob/f50827f2e28cbdbd82a4ea686c0498272b1460d6/runtime/image_classification/models/vgg16/gpus%3D16/hybrid_conf.json)
It only takes 600 seconds to train an epoch in the Imagenet dataset. Can you tell me how you generated this configuration file? Can you tell me the bandwidth parameter?