The HPC toolbox: fused matrix multiplication, convolution, data-parallel strided tensor primitives, OpenMP facilities, SIMD, JIT Assembler, CPU detection, state-of-the-art vectorized BLAS for floats and integers
281
stars
15
forks
source link
Jit assembler #7
Closed
mratsim closed 5 years ago