m4rs-mt / ILGPU

ILGPU JIT Compiler for high-performance .Net GPU programs
http://www.ilgpu.net
Other
1.38k stars 117 forks source link

SSE/ARM Neon Hardware acceleration for Velocity backend #1163

Closed m4rs-mt closed 6 months ago

m4rs-mt commented 9 months ago

This PR adds native hardware acceleration for 128bit vectors for Velocity. This backend is referred to as Velocity128. It features SSE and ARM64 Neon hardware acceleration for X64 and Arm64 platforms.