Open GoogleCodeExporter opened 9 years ago
maybe it has something to do with data alignment? so only use the fast move if
(both?) pointers are 8byte/16byte aligned?
(@s[11] AND 7 = 0) 8byte
(@s[11] AND 15 = 0) 16byte aligned
Original comment by andre.mussche
on 8 Apr 2014 at 7:40
When I move from 2 to 1 it's the slowest (10.8 sec).
From 2 >> 1 to 32 >> 1 it decreases to 8.4 sec. At 33 >> 1 it switches to 3.1
sec.
Original comment by david.br...@gmail.com
on 8 Apr 2014 at 8:18
I have tested on another computer with an AMD processor (SSE3).
The problem is the same, meaning it's a very big chance to appear on ANY
computer with AMD processor.
Original comment by david.br...@gmail.com
on 10 Apr 2014 at 6:22
Original issue reported on code.google.com by
david.br...@gmail.com
on 8 Apr 2014 at 7:29