mlcommons / training_policies

Issues related to MLPerf™ training policies, including rules and suggested changes
https://mlcommons.org/en/groups/training
Apache License 2.0
92 stars 66 forks source link

Clarify definition of app to prevent cache tricks #31

Closed petermattson closed 6 years ago

petermattson commented 6 years ago

Metric is defined as end to end execution of application. Is there a clever way to preload outside of this definition?

If the application is interpreted to be the python script, then that would exclude things like scp-ing data from GCS/S3 and/or running a block-cache pre-warmer to load the dataset into the Linux block cache (and/or taking the second training run because the first one was cancelled mid-way through / etc). Maybe it makes sense to provide guidance on the interpretation of application? What do you think?

bitfort commented 6 years ago