Open chengjunlu opened 1 week ago
The SLPVectorizer + IGC_DisablePHIScalarizer improves the overall 3% performance on flash attention forward kernel.
Enable the SLPVectorizer before the IGCVectorizer could do the same transformation.
The SLPVectorizer + IGC_DisablePHIScalarizer improves the overall 3% performance on flash attention forward kernel.
Enable the SLPVectorizer before the IGCVectorizer could do the same transformation.