mlcommons / training_policies

Issues related to MLPerf™ training policies, including rules and suggested changes
https://mlcommons.org/en/groups/training
Apache License 2.0
93 stars 66 forks source link

SGD & LARs for Resnet #336

Closed bitfort closed 4 years ago

bitfort commented 4 years ago

Proposal:

Submitters have invested in both LARS and SGD for this round and the rules were not very clear during this cycle. We seek to enable submitters who invested in either optimizer to successfully submit this round. We want to enable both SGD with polynomial learning rate schedules and LARS at lower batch sizes.

bitfort commented 4 years ago

This has been captured in the HP table.