Formal memory model - Githubissues

jfbastien commented 8 years ago

TL;DR: IMO getting formal memory model work started (not necessarily finished) should block moving the spec to Stage 3. I like SAB, but I don't trust English when we could have Maths.

A formal memory model akin to the work that was done for C++ would be useful in ensuring the SAB memory model is solid. Doing so found a bunch of issues for C++. This issue is related to #22.

The SAB model isn't quite like C++'s in that it's:

Based on memory locations and not objects and their lifetime.
Allows mixing accesses of different types sizes (but does require alignment).
- But doesn't have type-based aliasing rules.
- Doesn't have memcpy / memmove / memset (which are a trouble with the C11 spec).
Allows mixing non-atomic accesses and atomic accesses (see this paper and #13).
- This has extra issues w.r.t. non-lock-free operations, which would typically go through a lock shard provided by the VM.
Only supports seq_cst for now (but I'd like the SAB model to also support acquire / release as in #15 to show it'll work).
Doesn't support fence as in #25 since they aren't needed when only seq_cst is available.
Specifies futex (C++ puts mutex in the library), and maybe micro-wait (see #87).
Doesn't deal with signals and setjmp/longjmp at the moment, and doesn't have signal_fence.
Has to lower to different types of hardware (x86, ARM, A64, MIPS, POWER are all likely targets).
Tries to specify what happens when there are races (see #37, #48, #51, #71, and #82).
There can be multiple SABs, which would be similar in C++ to having multiple disjoint "memories".
There's another realm of JavaScript objects outside of the SABs, as well as an event loop and Web APIs, which could observe ordering of SAB operations indirectly.
SAB will likely interact with WebAssembly's own atomics (detailed proposal), similar to intra-process shared memory but without C++11's volatile escape hatch.
- This will be even more complicated if both don't have exactly the same memory model.

In that sense it's closer to what a hardware memory model looks like.

Specifically, I'd like something similar to Jade's thesis or Mark's thesis. Background history in these slides, it seems like SAT or SMT are ideally suited for this purpose.

Without this model the best case is that the English spec happens-to-work, but the worst case is that we move to Stage 3, browser ship without a flag, devs rely on things which we have to relax and browsers can't / won't break them. That would be unfortunate, but not the end of the world: witness Java, pre-C11 C, pthread, Linux, etc all having broken memory models and still working. Having a formal memory model is one of the rare cases in CS where Maths can be used to show tricky things work, I think it would be silly to ignore the last few years' advance in this field.

Having a formal model can also help figure out which optimizations are now invalid in implementations, e.g. 1 and 2, but I think this is just a nice side-effect of having a formal model in the first place.

littledan commented 8 years ago

EDIT: Moved to #91.

jfbastien commented 8 years ago

EDIT: Moved to #91.

littledan commented 8 years ago

EDIT: Moved to #91.

jfbastien commented 8 years ago

EDIT: Moved to #91.