Closed fchen7i closed 7 years ago
While I don't know the exact reason, I can only speculate. This might be to do with memory bus width. Spec says, it has 128 bit wide bus (aka 4 floats). I guess when it loads float8/float16, it is not fitting the cache line & trashing heavily
Thanks. I will close the issue.
Hello,
I have one question about global memory bandwidth. I find that global memory bandwidth may decrease for float8 and float16 in most of devices. I hope to know the reason why global memory bandwidth decreases. The following is my log from my MacPro.
Thanks a lot.