Open idoleat opened 2 months ago
Considering how each micro architecture differs on atomic insn implementation
I think this should be a separated subsection since it could be a long story on how compilers and different architectures ensure how an operation is logically atomic.
An overview of the subsection could be like
The benchmark result could thus reflect the performance and scalability on each implementation.
Benchmark atomic instructions latency and data through put
We provide a benchmark program to
Considering how each micro architecture differs on atomic insn implementation, core topology and cache coherence protocol, the benchmark should run on a wide variety of hardware platforms. May be we can invite volunteers to benchmark.
I am looking into Evaluating the Cost of Atomic Operations on Modern Architectures and its citations, examining how we can conduct the benchmark.
False sharing example
We provide an example showing how false sharing affect performance. Benchmark provided by Zeosleus or examples in previous sections could be used.
Before working on this proposal, I will add example for discussing ABA problem in section 6 first and HTML export as well.