pcclark4 / cggml

A general algorithms library written in ANSI C.
1 stars 0 forks source link

Change radix sort to be byte-based instead of bit-based #7

Closed pcclark4 closed 4 years ago

pcclark4 commented 4 years ago

This reduces the number of outer loop iterations from 32 to 4, with only mild space increase. This should give a good speedup in performance. However, I wonder if the extra 253 bytes of storage needed could be detrimental to embedded support? Maybe we need a flag or an overload for the other strategy instead?

pcclark4 commented 4 years ago

I decided to keep the bit-wise version around in case the need for less memory footprint.