NVIDIA / modulus

Open-source deep-learning framework for building, training, and fine-tuning deep learning models using state-of-the-art Physics-ML methods
https://developer.nvidia.com/modulus
Apache License 2.0
836 stars 190 forks source link

Add tensor core support to CorrDiff training pipeline #402

Closed akshaysubr closed 1 month ago

akshaysubr commented 4 months ago

Modulus Pull Request

Description

This PR adds support for TF32 based matrix multiplies to speed up training by ~3.2x

Checklist

Dependencies

None