dmlc / keras

Deep Learning library for Python. Convnets, recurrent neural networks, and more. Runs on MXNet, Theano or TensorFlow.
http://keras.io/
Other
125 stars 34 forks source link

Working multi-GPU ResNet-50 on ImageNet with MxNet backend #74

Closed mkolod closed 5 years ago

mkolod commented 7 years ago

This PR is to provide a performant example of training a fairly large model (ResNet-50) on a large dataset (ImageNet) that demonstrates the MxNet backend perf for a single GPU and scaling to multiple GPUs (tested up to 8 P100s).

I'm sure there are things that may need to be iterated on, especially the need to copy over data.py from MxNet, since it's beyond the scope of the root of the mxnet Python package (it's from mxnet/example/image-classification/common, but the root of the Python MxNet package starts at mxnet/python/mxnet, so data.py cannot be imported from keras/examples). I'll make the necessary changes as I get feedback, of course.