Open vstakhov opened 9 years ago
Can you try the attached patch?
That helped, thank you.
bin/blake2b-util bench
time granularity: 24 cycles, 2195297384 cycles/second
1 byte(s):
avx2, 396.00 cycles per call, 396.0000 cycles/byte
avx, 333.00 cycles per call, 333.0000 cycles/byte
x86, 356.00 cycles per call, 356.0000 cycles/byte
generic/64, 586.00 cycles per call, 586.0000 cycles/byte
128 byte(s):
avx2, 389.00 cycles per call, 3.0391 cycles/byte
avx, 316.00 cycles per call, 2.4688 cycles/byte
x86, 353.00 cycles per call, 2.7578 cycles/byte
generic/64, 581.00 cycles per call, 4.5391 cycles/byte
576 byte(s):
avx2, 1416.00 cycles per call, 2.4583 cycles/byte
avx, 1474.00 cycles per call, 2.5590 cycles/byte
x86, 1648.00 cycles per call, 2.8611 cycles/byte
generic/64, 2450.00 cycles per call, 4.2535 cycles/byte
8192 byte(s):
avx2, 16426.00 cycles per call, 2.0051 cycles/byte
avx, 18352.00 cycles per call, 2.2402 cycles/byte
x86, 20888.00 cycles per call, 2.5498 cycles/byte
generic/64, 28548.00 cycles per call, 3.4849 cycles/byte
Looks like #3 corrects this issue.
I've tried to compile the code on OSX system. However, the compilation failed with the following error:
I've tried to fix it by switching to
%rip
addressing applying patch like this one: https://gist.github.com/vstakhov/37442eaf04ebfdd315e0 but despite of compiling it caused core dump:Registers content:
Other extensions work fine after fuzz testing.