Make PyTorch models up to 40% faster! Thunder is a source to source compiler for PyTorch. It enables using different hardware executors at once; across one or thousands of GPUs.
Apache License 2.0
1.07k
stars
60
forks
source link
cuDNN executor: No valid engine configs for MUL_Reduction_MUL_Matmul_MUL_ADD_SUB_EXP_Reshape_Matmul_Matmul_MUL_SUB_MUL_Reduction_MUL_Reshape_Matmul_Reshape_Matmul_ #625
A quantization test that I want to merge does not work with the cuDNN executor: