unzvfu / cuda-fixnum

Extended-precision modular arithmetic library that targets CUDA.
MIT License
34 stars 8 forks source link

Implement Wallace tree multiplier #45

Open unzvfu opened 4 years ago

unzvfu commented 4 years ago

Reduces the number of carry-reduction layers from O(n) to O(log(n)).

See: https://en.wikipedia.org/wiki/Wallace_tree