facebookresearch / FAMBench

Benchmarks to capture important workloads.
Apache License 2.0
28 stars 23 forks source link

TF32 changes #87

Closed nrsatish closed 2 years ago

nrsatish commented 2 years ago

This is to ensure GPU runs can be done in TF32 precision. Adds flags to linear and gemm ubenches as well as DLRM OOTB bench.