This introduces compile-time tweak environment variables like this:
MATMUL_DGEMM_NC
MATMUL_DGEMM_MC
MATMUL_DGEMM_KC
etc for each gemm. These allow setting these size parameters at
compile time - they should ideally be optimized per kernel and
microarch.
Combine these parameters with the benchmark in ./examples/benchmark.rs
and its csv output option - this allows optimizing performance depending
on these parameters. This was already useful for complex.
This introduces compile-time tweak environment variables like this:
etc for each gemm. These allow setting these size parameters at compile time - they should ideally be optimized per kernel and microarch.
Combine these parameters with the benchmark in ./examples/benchmark.rs and its csv output option - this allows optimizing performance depending on these parameters. This was already useful for complex.
Using DutchGhost's const parsing code from
https://gist.github.com/DutchGhost/d8604a3c796479777fe9f5e25d855cfd
which has been very useful.
Co-authored-by: DutchGhost DutchGhost@users.noreply.github.com