google-research / robustness_metrics

Apache License 2.0
464 stars 33 forks source link

Stabilize runtime measurement for predictions #5

Open dustinvtran opened 3 years ago

dustinvtran commented 3 years ago

Including time_in_ms in outputted predictions works in principle. However, there's too much variability across datacenters that the value is unreliable. To measure this reliably, we need to do a more controlled experiment, fixing the accelerator and measuring model runtimes on that same accelerator. (See Uncertainty Baselines' internal profile Colab notebooks.)