AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.
Apache License 2.0
4.54k
stars
363
forks
source link
Del using namespace aitemplate/AITemplate/static/include/kernels/mem_eff_attention/kernel_forward.h #1014
Summary:
Removes a using namespace from the global namespace in pursuit of enabling -Wheader-hygiene.
Qualifies instances that relied on the using namespace.
Summary: Removes a
using namespace
from the global namespace in pursuit of enabling-Wheader-hygiene
. Qualifies instances that relied on theusing namespace
.Differential Revision: D60066235