Optimizer Design and related Operator

jacquesqiao commented 7 years ago

Implement various optimizer operators.
1. SGD (done) and others listed in https://github.com/PaddlePaddle/Paddle/projects/22
Add optimizer operators into BlockDesc.
Python module

In the new framework with operators, we will use operators to build optimizer, and operators will be add to some block. There are some works to do:

write a python wrapper of optimizer and provide proper interface for user to use.
optimizer of multi GPU and multi machine is different then single device.
- because it will be a seperate stage to update the parameters.
- it will communicate with parameter server

there are three situation to be considered:

8/24/2017 do some survey and discussion about tensorflow, caffe2 and pytorch on the python side. [done] https://github.com/PaddlePaddle/Paddle/issues/3672
8/25/2017 do some survey and discussion about distribute optimizers.
8/25/2017 design doc

jacquesqiao commented 7 years ago

jacquesqiao commented 7 years ago