Firstly, i use m=1024 n=1024 k=3072 to tune SGEMM kernel, i get a best configuration.
But, when i use m=2048 n=2048 k=2048 to tune SGEMM kernel, i find the best configuration that i got by m=1024 n=1024 k=3072 encounters l2 error.
So, can i use the best configuration that i got by m=1024 n=1024 k=3072 to be the default parameter in xgemm_32.hpp?
I afraid this configuration may cause some computing errors in other matrix shapes like m=2048 n=2048 k=2048.
In theory yes, but, the safest would be to actually run a correctness test in that case. Or choose one of the other tuning parameters: typically a lot of them are close to the optimal value.
Firstly, i use m=1024 n=1024 k=3072 to tune SGEMM kernel, i get a best configuration. But, when i use m=2048 n=2048 k=2048 to tune SGEMM kernel, i find the best configuration that i got by m=1024 n=1024 k=3072 encounters l2 error. So, can i use the best configuration that i got by m=1024 n=1024 k=3072 to be the default parameter in xgemm_32.hpp? I afraid this configuration may cause some computing errors in other matrix shapes like m=2048 n=2048 k=2048.