Opening a pull request to showcase the movnt change I made, and if you guys can point out some issue. I don't see much speedup after changing to movnt, although I should.
This pull request has 2 changes:
Implementation of Log2() method, which is the faster Logging method for undoTx.
Change Log2() to use movnt.
There are no other changes that we have been discussing in the recent past. Even only with movnt() we should see speedups.
Assembly obtained from objdump of Intel's movnt C intrinsics:
Opening a pull request to showcase the movnt change I made, and if you guys can point out some issue. I don't see much speedup after changing to movnt, although I should.
This pull request has 2 changes:
There are no other changes that we have been discussing in the recent past. Even only with movnt() we should see speedups.
Assembly obtained from objdump of Intel's movnt C intrinsics:
Objdump for these are: