Open gfursin opened 5 years ago
As per these measurements, P3 sped up the data parallel training throughput of ResNet-50 by 15%. This particular experiment uses 25 Gbps network. In our experience 25 Gbps is sufficient for linearly scaling ResNet-50 as it being relatively small model. This is the reason why P3 is not able to provide significant speedup over baseline. In the controlled experiment we conducted, we were getting a peak speed up of 25% for ResNet-50.
That sounds good! Thanks!
Setup is similar to #1 : 1 master machine and 4 nodes with GPU Nvidia GTX 1080 Ti and Intel Ethernet Controller XXV710 for 25GbE SFP28 (Lille nodes).
Results from P3:
Complete log: experiment-resnet-github.log
Baseline results:
Complete log: experiment-resnet-baseline.log