mlcommons / training_policies

Issues related to MLPerf™ training policies, including rules and suggested changes
https://mlcommons.org/en/groups/training
Apache License 2.0
92 stars 66 forks source link

Epoch numbering from 1 #314

Closed mwawrzos closed 4 years ago

mwawrzos commented 4 years ago

According to the last logging spec:

epoch_start Metadata: “epoch_num”: (epoch number starting from 1) https://docs.google.com/document/d/1u15ouRbvd2xZkMEHNtmtg-yGzw0Zs6KVxQG_5WjTz7o/edit#

googlebot commented 4 years ago

We found a Contributor License Agreement for you (the sender of this pull request), but were unable to find agreements for all the commit author(s) or Co-authors. If you authored these, maybe you used a different email address in the git commits than was used to sign the CLA (login here to double check)? If these were authored by someone else, then they will need to sign a CLA as well, and confirm that they're okay with these being contributed to Google. In order to pass this check, please resolve this problem and then comment @googlebot I fixed it.. If the bot doesn't comment, it means it doesn't think anything has changed.

ℹ️ Googlers: Go here for more info.

googlebot commented 4 years ago

CLAs look good, thanks!

ℹ️ Googlers: Go here for more info.

matthew-frank commented 4 years ago

This PR is to resolve https://github.com/mlperf/training_policies/issues/132 which ends with @christ1ne saying:

The latest logging spec asks for epoch starting from 1. Please ignore the last update. @petermattson please update the rule doc if needed.

matthew-frank commented 4 years ago

And is also related to https://github.com/mlperf/training_policies/issues/160