Open yaoqr15 opened 4 years ago
I'm not sure but perhaps the prefetchers have gotten to a point of confusion. If your application is doing this in practice I would measure the various cache hit rates and start experimenting with adding SW prefetch instructions or reorganize data into smaller sub-blocks.
I'm not sure but perhaps the prefetchers have gotten to a point of confusion. If your application is doing this in practice I would measure the various cache hit rates and start experimenting with adding SW prefetch instructions or reorganize data into smaller sub-blocks.
Thanks for your reply! I surmised that the phenomenon may be related to the cache in CPU before asking. Your suggestion is very helpful.
My setting: 64MiB chunk size. I test from 32+4 to 35+4 . CPU: Intel(R) Xeon(R) Silver 4110, Memory 256 GiB, Ubuntu 16.04. The results are as follows:
The time is measured in ms. Anybody knows the reason why the time jumps too fast from k=34 to k=35? Thanks!!