ROCm / AITemplate

AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.
Apache License 2.0
11 stars 7 forks source link

No 6000 Series Support? #82

Open jtdorseyiii opened 2 months ago

jtdorseyiii commented 2 months ago

I can't seem to find any branch that would have support for 6000-series cards. I have a 6900XT, that is more than capable of handling this workload, but I cannot find anything to suggest it was ever supported.

Would it be possible to merge in support, or can you point me in the direction I need to go in order to get this working on the 6900XT?

Thank you!