Optimize Parallel Version of Interpolate

Neptune-Crypto / twenty-first

Collection of mathematics routines and cryptography for the twenty-first century

GNU General Public License v2.0

74 stars 22 forks source link

Function par_interpolate has weird behavior for small domain sizes. In particular, it is faster when (some of) its subroutines are sequential.

There is a lot of potential for optimization here. In general it is okay to rely on dispatcher methods that choose the asymptotically superior or concretely superior algorithm depending on some threshold, but in the context of parallel hardware we ideally want hardcoded thresholds to be independent of the number of cores/threads. It is allowable to call available_parallelism and make a decision based on that. This task involves finding the optimal cascade of specialized functions and the optimal dispatch criteria.

Neptune-Crypto / twenty-first

Optimize Parallel Version of Interpolate #227