Closed Z-Zheng closed 5 years ago
pass all parameters of model into optimizer rather than only trainable part for universal checkpoint operations
pass all parameters of model into optimizer rather than only trainable part for universal checkpoint operations