data61 / cuda-fixnum

Extended-precision modular arithmetic library that targets CUDA.
Other
41 stars 28 forks source link

Specialise multi_modexp to case where all exponents are small #25

Open unzvfu opened 6 years ago

unzvfu commented 6 years ago

There is an implementation of this in the attic that definitely performs better than the full version. It would be nicer still to have a single version that works well in both cases; this might be achieved by calculating max(bitlen(exp[i])) over all the exponents and starting the main loop from there.

unzvfu commented 4 years ago

Follow up at https://github.com/unzvfu/cuda-fixnum/issues/11.