tromp / cuckoo

a memory-bound graph-theoretic proof-of-work system
Other
822 stars 173 forks source link

Add SSE2 variants for Siphash24 function #46

Closed OpticFlowX closed 6 years ago

OpticFlowX commented 6 years ago

Added siphash24x2and siphash24x4variants with SSE2 instruction optimization. Also replaced ROT16 in the SSE2 variant with shuffle instead of shift instructions.

There are two build examples, lean30x2and lean30x4in the Makefile, with a flag to disable AVX2.

Tested on the lean miner against verify30with nounces 63 and 140902 (on a single thread).