JuliaLinearAlgebra / Octavian.jl

Multi-threaded BLAS-like library that provides pure Julia matrix multiplication
https://julialinearalgebra.github.io/Octavian.jl/stable/
Other
230 stars 18 forks source link

vfmaddsub complex matmul #140

Open chriselrod opened 2 years ago

chriselrod commented 2 years ago

If anyone wants to give it a go, see here for an example: https://github.com/JuliaSIMD/LoopVectorization.jl/blob/ed466fb1ca7e92b70b98d6ee50eb5544b64678e1/test/shuffleloadstores.jl#L143-L157

I'd be happy to answer any questions.

This requires the FMA instruction set, meaning x86_64 only. Thus, using these should be behind a check for this.