Tensor cores - enabled -> as a flag?

jere357 commented 9 months ago

You are using a CUDA device ('NVIDIA GeForce RTX 3060') that has Tensor Cores. To properly utilize them, you should set torch.set_float32_matmul_precision('medium' | 'high') which will trade-off precision for performance. For more details, read https://pytorch.org/docs/stable/generated/torch.set_float32_matmul_precision.html#torch.set_float32_matmul_precision

[ ] write some code that detects tensor core GPUs and sets this matmul precision?
[x] benchmark things with this and without this
[ ] decide if this is another flag for the docker run command

jere357 commented 9 months ago

I ran 4 experiments on our ada6k, training vit_l_16 with different setups, trying to see how much this flags helps. Seems to be significant only for --precision 32 training.

float32_matmul_precision	--precision	img/s
highest (default)	32	47
highest (default)	16-mixed	177
high	32	94
high	16-mixed	176

bfreskura commented 7 months ago

@jere357 Can you create a PR for this?

tensorpix / benchmarking-cv-models

Tensor cores - enabled -> as a flag? #8