p12tic / libsimdpp

Portable header-only C++ low level SIMD library
Boost Software License 1.0
1.24k stars 129 forks source link

Add bitonic sort algorithm #173

Closed p12tic closed 5 months ago

p12tic commented 5 months ago

Currently only low-level register-based interface is exposed. Even the 16 lane sort is small enough and uses few enough registers on e.g. AVX2 that it makes sense to inline it and pass both input and gather output via SIMD registers without going to memory.

A higher-level, memory based interface can be exposed in the future.