wichtounet / etl

Blazing-fast Expression Templates Library (ETL) with GPU support, in C++
MIT License
216 stars 17 forks source link

Add Tensor Core support for convolutions #5

Closed ghost closed 4 years ago

ghost commented 4 years ago

This PR activates Tensor Cores for all convolution types in ETL. Activating this on GPUs that have no Tensor Cores results in worse performance (at least with a NVIDIA GTX 1650), so this should be activated on-demand. Though Tensor Cores appear to work with these changes, the performance remains the same as with them disabled (on a NVIDIA V100 GPU), so more testing is needed.