Closed fhs closed 9 years ago
LGTM
Would you mean explaining a bit for my education? I tried googling but the discussions were pretty technical. I can see the affect of the change, and understand the differences. However, it seems like there are two instructions that are equally fast, where one only works on aligned data, and the other works on any data. Why would one want to use MOVAPD when you could use MOVUPD instead? Is there a theoretic speed penalty? Is it just that you didn't understand? I'm not trying to be accusatory, just trying to keep up on top of code development. Thanks for the changes.
Just a note, early merge because it fixes ddot.We should still consider the incremented test if appropropriate.
MODAPD is suppose to be faster than MOVUPD, but in practice, at least on the CPU I ran the benchmarks on, it doesn't make much difference. It general alignment is a good thing. See: https://groups.google.com/d/msg/golang-nuts/HFjwPFYrCqg/oSttxY8ajr4J
I'm going to leave a related note on that thread for future readers. Feel free to chime in if you have things to add.
Thanks for the fix.
No real change in the benchmark results (Go 1.4 on Intel(R) Xeon(R) CPU E5-2660 0 @ 2.20GHz):
Fixes #77