tc39 / proposal-signals

A proposal to add signals to JavaScript.
MIT License
2.87k stars 54 forks source link

Reasoning behind the push-then-pull model #196

Open samholmes opened 3 weeks ago

samholmes commented 3 weeks ago

I’d like to better understand the reasoning behind the push-then-pull model specified in this proposal. I’m looking for strong justification for it besides it being novel.

From my understanding, signals seem to be a push based system much like observables. The benefit of signals over observables would be their ergonomics and resource management. That would be my understanding, but I am open to being corrected in this understanding.

My understanding of the push-then-pull model is that dirty flags are pushed first through the graph, then only signals which are attached to some watcher endpoints (effects) are then pulling the graph where it’s dirty. The pulling starts at the beginning of the graph and the state changes propagate down the dirty paths until either reaching the end of the graph or a node where the state hasn’t changed (a === comparison).

If this understanding is correct, it seems to assuming that some optimization is to be had by eagerly propagating cache invalidation and then lazily propagating state changes. This would be an optimization assuming that portions of the graph may be frequently idle (not watched) only to become active when an effect is “mounted” which activates those portions of the graph.

From this, I can conclude that this proposal of signals is to suggest that signals are to be a state management system where the application state control flow is to be held in memory at all times, but the propagation of the state passing through the control flow is to be inactive until actively enabled by some watcher (effects). If this is a valid assessment, then my curiosity is why not leverage the language for this design: control flow can exist as function references in memory, and state only exists at the endpoints. Allow me to elaborate…

If a portion of the graph is not active, because no endpoints are watching this portion of the graph, then why hold reference to this sub-graph in memory at all? If all in-memory references to the instances are of the signals in this subgraph are not held, then the GC should clean them up. However, I could imagine that the references to them are held in memory while waiting for some event to mount a watcher or effect to activate them, but then the question is why track cache invalidation for them with a dirty flag? If they were in-active before the effect/watcher is mounted, then they are dirty because they haven’t been observed yet. So, we only seem to need to track whether the signal is being observed/pulled by some endpoint in the graph. If it is not, then sources shouldn’t propagate changes to them (nor dirty flags). If a signal is actively contacted to some endpoint then sources will push the changes. This avoids the two step process of push then pull, and simplifies it to push-but-only-where-necessary.

If I’m missing something critical to understanding the advantages of a push-the-pull model over a push model which tracks the active portions of the graph for optimizations, then please share that insight.

samholmes commented 3 weeks ago

It would seem that “effects” control the pulling of the graph when they are “mounted”. Another way of saying this is that the system is push based to only the active parts (the pulled parts). This would not require a “dirty flag” but instead an active flag which is pulled by effects. The system would be pushed to only the active portion of the graph (one trip, instead of a round trip with the push-then-pull model).

It seems like this is just a lazy push system, where pushes only propagate when the paths are activated by some effects at the end of the graph. I believe this is how Preact works, but don’t quote me on this.

If an effect mounts (activating a portion of the graph) and then it unmounts and is garbage collected, then if source nodes are weakly held references, the cleanup would cascade up to where the root nodes may be held in memory (which may be capturing some events). So, it would be clear that resource management happens at the root nodes, and not the endpoints. The endpoints are held in memory for enabling/disabling effects in our program, but ultimately they inherit their life cycle from their sources. The top-most source is a root. As long as a root exist in memory (perhaps as an event listener capturing events in our program) then its sources live with it only where the effects are instantiated. If a root node is GC'd, then the entire sub-graph depending on it should be GC'd as well, regardless as to whether the effects were "unmounted". Why? Because why persist an effect which will no longer be re-computed from any new state change?

This is exactly how Flash handles the graph: all source references are held weakly, and all target references (sinks) are held strongly. If a root is removed, all strong references to targets are removed. If no other strong references are held to these targets either by other source (sink) nodes, or by some JavaScript reference to the node, then they're scheduled for cleanup by the GC. This cascades down to effects, where they'll be automatically cleaned regardless of any unmounting.

aigan commented 3 weeks ago

A batch of changes will invalidate lots of dependencies. Having the dirty flag can save a lot of evaluation. Effects should only happen at the end of transaction.

samholmes commented 3 weeks ago

A batch of changes will invalidate lots of dependencies. Having the dirty flag can save a lot of evaluation. Effects should only happen at the end of transaction.

Can you be a bit more exhaustive with your point here? I'm having to read between the lines of what you're saying; some examples could help.

My question was about justifying push-then-pull model outside of it being novel or having a narrow case for optimization. I tried to expand on the case where there would be optimizations which would be only when the graph is not used. When the graph is not used, then there is remains the low overhead of pushing dirty flags. The purpose of an unused graph is to maintain idle code paths which may later become active by mounting an effect. Though, why not then just mark the idle code paths as idle and not propagate a dirty flag at all? When the code path becomes active by a mounted effect, then the push of changes propagate without any need to check for dirty flags. This means the model is a lazy push model and achieve the same result if not a better result (less unnecessary steps).

I haven't mentioned batching in this issue, so I'm curious why it's brought up? No batching occurs in a lazy-push model, unless you consider pulling that occurs once when mounting an effect? Again, can't read between the lines of what you're saying, so please clarify.

shaylew commented 2 weeks ago

My understanding of the push-then-pull model is that dirty flags are pushed first through the graph, then only signals which are attached to some watcher endpoints (effects) are then pulling the graph where it’s dirty. The pulling starts at the beginning of the graph and the state changes propagate down the dirty paths until either reaching the end of the graph or a node where the state hasn’t changed (a === comparison).

I think you're mistaking the "pull" part as a single distinct phase of the algorithm, but it isn't really. The pull happens when user code calls .get on a Computed to demand an up to date value. This means different parts of the graph can be pulled at different times and on different schedules.

Some parts of the graph may not end up getting pulled at all, if other changes make them irrelevant -- e.g. if they're only consumed in one branch of an if. As far as I know, no model that pushes changed values forward can avoid unnecessary recomputation in these cases; they only discover that parts of the graph turned out to have become unused after they've already recomputed them.

The basic guarantees of the push-then-pull model are:

  1. Watchers get notified when a signal they're watching may have changed because something was set
  2. Computeds are lazy, and only rerun when someone calls .get
  3. Computeds behave identically (in the timing and the number of times they run) whether or not a watcher depends on them.

That is:

Checking the dirty flag on a Computed if it's "live" is also an optimization in a similar vein. It doesn't change the outcome of when or which computeds rerun (guideline 3). It's just that if a computed does happen to be in the live part of the graph and you see it's not marked dirty, you know (without having to look at them) that none of its dependencies could possibly have changed.

samholmes commented 2 weeks ago

@shaylew Thanks for the thorough explanation. I can see why the model can be beneficial as I reason about your explanation.

What point stood out to me was the mention of consumers that no longer need to pull a path by the change of a condition in a branch (if statement). You mentioned how you're unaware of how a push-model could avoid unnecessary re-computation, but I suppose this depends on order of events. Let's get into an example:

const computed = new Signal.Computed(() => a.get() ?  b.get() : c.get)

In this example computed will evaluate b if a is truthy, otherwise it'll evaluate c. In a push model, evaluating b can be avoided if the computed is evaluated before an update to b where a.get() === false. So the order of sets matters:

a.set(false)
b.set(val)

In this case, an update to b in the push model will not impact computed evaluation. But, that's not what is of interest; what is interesting is when b is set before when a is set.

b.set(val) // causes `computed` to re-evaluate `b`
a.set(false) // causes `computed` to re-evaluate `c`

In that case, the push will cause computed to re-evaluate in a push-based system. However, how would this be avoided in a push-then-pull system? Would the pulling start after b is set, or is there a batching process of some kind? If the former, then push-then-pull wouldn't give us any benefit. Does this example miss the point?

It seems to me that the cases here really are based on the order of changes in the system and the races for those changes to whatever batching may be implemented for optimizations. All of that said, you're going to get the performance benefits of the system you're given, and the next question will be what cases will you need to be more explicit in how you use of the system to gain the optimizations that are relevant for the cases for your app.

samholmes commented 2 weeks ago

Untitled-2024-02-29-1156

I'm trying to contrive an example where push-than-pull leads to some optimization when it comes to re-evaluation, but I only see added complexity.

If you look at the example, the pull system simply triggers depth-first re-evaluations down the graph. During re-evaluations, we can assume the current cached state in each node is the latest state since we're pushing changes down the graph (from left to right). After each node is re-evaluated, its state is updated (yellow to blue). After its update it cascades to another push. There is never a need to track a "dirty" flag since updates are pushed immediately. Any value "pulled" during evaluation is taken directly from the cached state in the node which should be up-to-date for all parent nodes.

I'm looking for an example where an optimization can be had. I can only imagine an optimization case if the first phase (push) was able to allocate multiple updates of root-nodes before end-nodes pulled values down as they re-evaluate. Which is why I ask about a "batching process of some kind".

shaylew commented 2 weeks ago

So, two things:

Some kind of batching process is the norm for UI use cases. The proposal doesn't explicitly need to include this as a named feature because it leaves all the pull timing to user or framework code. Effectively if you call set on a variety of signals and then get a Computed, you've done a series of (non-batched, but deduplicated) notification pushes and then pulled a value from a computed that may have had multiple different inputs change since the last time it ran. If you couldn't do this sort of batching, any operation that changed many states would result in a large number of wasted recomputations whose results would never be rendered because they'd be obsoleted again before the next frame.

The second thing is: when you say

During re-evaluations, we can assume the current cached state in each node is the latest state since we're pushing changes down the graph (from left to right).

it turns out that "from left to right" is doing a lot of work here, and depth-first and left-to-right are different things. For an example of where the naïve push-based approach goes wrong, think about a graph like this: image Imagine that C is currently true, so the dotted edge isn't present yet (the node on the right didn't ask for G last time it ran). Now you change S in a way that causes F, G, and C to all have new values.

So if you want a working push system:

You can come up with coherent answers to these issues, and you end up with something like Jane Street's Incremental. But the thing you end up with isn't compatible with autotracking as far as I know, and it just bites the bullet on the "unnecessarily recomputing F" front and decides to live with that issue. Autotracking is one of the main reasons for this being a language proposal, so I don't think this sort of topologically-sorted push system is a good fit for its goals.

samholmes commented 2 weeks ago

So now I realized that the spec leaves the behavior of how the graph gets pulled up to the watchers (user-land). This gives flexibility of optimizing the way the graph is read / pulled. So, a user of the graph must be aware of which tool they're using to subcribe to the graph. Very interesting, and makes really good sense for the motivation.

Though, all the benefits for this model which leaves the pulling up to how watchers are implemented comes with the overhead of managing a little bit of extra state (dirty flag) and an extra few trivial calls between pulls on the graph (the real evaluation of the state). I'm curious at what case is the extra work a worthwhile trade-off? Is there any analysis on this trade-off specifically?

Given a naive watcher which pulls on every notification from the graph, it wouldn't be more optimal than a push-system. So what are some rudimentary watcher implementations which give you some net gain in optimization and at what minimum topology and update frequency would the gains begin?

samholmes commented 2 weeks ago

Given this spec moves the responsibility of how to most optimally read from the graph to the user-land, have we considered the alternative approach to the problem: moving the responsibility of how most optimally update (push) the graph to the user-land?

This is the basis of my questioning for the push-than-pull model. With the spec, we consider all the strategies of batching, and what not, to be done at the watcher, which sits between two stages push and pull. The alternative to this, that i'm willing to explorer for the sake of this proposal, is to allow for strategies to be implemented at any node in the graph. This can be done with a push-based system where each node tracks some state and can reduce over its previous state while evaluating (see Flash Reducers). And with such a capability, the entire architecture of push-then-pull model could be opt-in with an user-land implementation of such node types. What's beneficial about this is that you can then apply this strategy to only parts of your graph and mix and match other strategies as well. I'll have to contrive an example soon to illustrate.

samholmes commented 1 week ago

This is just some chicken-scratch code to illustrate how a push model, which can access some execution context values, could fully emulate the push-then-pull model.

const onPubSub = compute => {
  const computed = on(() => compute())
  // This is the computed signal derived from the compute function.
  // It will auto-track all of the signals invoked in the compute function.

  const dirty = on(() => computed() !== off(cached))
  // Track whether the computed becomes dirty by comparing it to the cached value.

  const cached = on(() => {
    const { cache } = own() ?? {} 
    // Get the cached value from internal state (or default to undefined)

    if (dirty()) {
      const newCache = when('pull') ? off(compute) : cache
      // Only recompute the value if the current execution context contains
      // 'pull', otherwise get the cached value.

      own({ value })
      // Always update the internal state to propagate dirtiness.
      // By setting a new object always, the signal will re-evaluate and propagate 
      // a change to sinks in the graph.
    }

    return cache
    // return internal cached value.
    // Signals can act like "views" over internal state in this way.
    // Even though cache may not have changed, the internal state change can force
    // push changes downstream.
  })
  // This is a signal that caches and returns the computed value.
  // Initially, the cache is undefined, which will result in `dirty() === true`.
  // Pulling while `dirty() === true` will recompute the value and update the cache.
  // The `cached` signal updating will _not_ invoke the dirty signal because it
  // doesn't track `cached` by use of `off(cached)`, avoiding cycles
  // In summary, cached only recomputes for the value when pulled and the signal
  // tracking the compute value is dirty.

  return cached
}

const source = on<number>(initialValue)
// Create some regular non-pubsub signal as a source.
// This to illustrate compatibility with regular signals.

const square = onPubSub(() => source() ** 2)
// Create our first pub-sub signal which is the square of the source signal.
// Internally a signal is tracking the source state, but only to update the
// internal dirty state. The dirty state is what propagates as a push through
// the signal graph.

const b = onPubSub(() => square() / 2)
// Create our second pub-sub signal which is half of the square.
// It is tracking another pubsub signal so it will only receive dirty pushes.

// Usage:
const effect = on(() => {
  b(false)
  // Track `b` without pulling it.

  // Subscribes to dirty notifications with `when` utility which returns true
  // when the signal is what caused the re-compute for this signal.
  if (when(b)) {
    // ...do extra logic to determine whether to pull `b` or not...

    b.with('pull') // pull!
    // Step 1:
    // `b.with('pull')` will pull the compute function for `b` if `b` is dirty.
    // This is done by adding 'pull' to the execution context stack.

    // Step 2:
    // `compute()` will re-evaluate `square() / 2` which will pull `square()`
    // if its dirty.

    // Step 3:
    // `square()` will re-evaluate `source() ** 2` which will pull `source()`.
  }
})()

We first define an abstraction for the push-than-pull signal called onPubSub. It will push "dirty" state. It accepts a compute function as you'd expect for a Computed signal, however it only uses the compute function to auto-track during graph initialization and when explicitly pulled by watchers. This special "pull-mode" can trivially be implemented with access to some "execution context" within the signal, this is illustrated with the when hypothetical API.

In this example, when(...x) will return true if x is within the execution context stack. This execution context contains the an ordered stack of signals which were invoked prior to the current signal invocation. This allows a signal to have context over which path in the graph has been traversed to result in a re-invocation for the signal. This is particularly useful for redux-like reducers which could change signal state depending on which action signal had re-evaluated. In addition to this use-case, a theoretical .with(...x) method is implemented on signals in order explicitly invoke a signal under some specific context defined by the caller. This means you can add arbitrary closure to signal compute functions.

Alternatively, one could implement the same execution context trick for changing signal evaluation modes by defining a impure compute function with the context parameterized:

let isPushingContext = false // at module scope; shared with all onPubSub instances

// Inside the onPushPull implementation:
const cached = on((push) => {
  // set the context value to match the push parameter if it's set.
  // Otherwise, set the push parameter to the current state of the context.
  // Evaluate the compute function if `push === true` and `dirty()`...
  // If push argument was given, reset the isPushingContext state in a setImmediate
})

Though this attempt is more hacky and error prone than being provided with an execution context API built in.

Conclusion

I want to illustrate the point here that a purely push model is all the that is needed to define other models at the user-land level. This is case for defaulting to the push-model which is the missing model in the JavaScript language; we already have a pull-model with function invocation and iterators. We have a push model with promises for single values, but we don't have a standard push model for multiple values standardized yet. Observables was the original attempt at this initiative. I think Signals are a better and more refined concept to take the place of the multiple-value push-based primitive than Observables for the main reason of auto-tracking and minimalist API and semantic constructs. Over-complicating their implementation without any means of opting-out of standard behavior, would miss the opportunity to cover all applications which simply need a multiple-value push-based primitive.

Exhausting my case for what signals should be

So far, my multi-year research on the topic of signals (or observables, or reactors) has lead me to the conclusion that all that is missing is the ability to define functions which are invoked implicitly instead of explicitly: a function f can be invoked by the invocation of another function g. These "reactive functions" are all that is needed to achieve state pushes. Typically function invocation call-stacks start by invoking the dependent f before invoking the dependency g; this is state pulling. By inverting the function invocation call-stack, we get state pushing: invoking g first to get its value before invoking f. Of course, the system can cache the g return value after invoking it before evaluating f. Caching can be done during evaluation time just like a regular call-stack would stack the execution contexts, or long-term caching can be opt-in for compute-time optimizations in exchange for decreased memory-optimizations (just like one might chose caching where its needed in a regular call-stack).

By realizing that the push system is just an inversion of the call-stack, it becomes the obvious answer to the missing push state primitive in any functional language (including JavaScript). Whether this kind of reactive primitive fits the specification for what defines a "signal" or not, remains irrelevant because it is at least more pure to the concept of a reactive primitive standard from which all other reactive systems could derive.

modderme123 commented 1 week ago

It's true that a pull is just two pushes. However, one of the goals of this proposal is to ensure that we have interoperability between frameworks that use Signals with the guarantees that Shay mentioned above. I think requiring people to build custom push protocols to emulate pull messages makes interoperability between frameworks harder.

samholmes commented 1 week ago

@modderme123 By my point isn't that a pull is just two pushes. I shall look at the context you've provided to understand what is meant here by pull is two pushes.

Instead, my point is that the language already has 'pull' mechanisms. And introducing a push mechanism can enable any protocol desired in the graph. The push-then-pull model is simply a protocol within the graph that may be interoperable with most signal libraries, but certainly it's not interoperable with all of them. If only a push model is standardized, then it is possible for the standard to be interoperable with all reactive frameworks whether signal-based or observable-based.

For example, there is no way for this signals proposal to be used as the implementation details for Flash, because Flash, like observables, is push-based. Therefore, in order for Flash to be interoperable with other libraries which consume a standard reactive primitive such as signals, then Flash would need to compromise on the design entirely. There may not even be ways to achieve the design decisions in Flash using the standard. This would render the signals standard useless for Flash's application. Which is unfortunate, because then the language could become fragmented with a standard for reactivity, and many identical user-land reactivity implementations that only require a push-model. That is until the push-model primitive is standardized as something else, and then we'd have two primitives in the language and a bit of redundancy.

samholmes commented 1 week ago

So if you want a working push system:

  • You had better find a way to make sure you've computed the latest value of C in time for it to be consumed in the if. Neither the shortest path (BFS) and the first path (DFS) necessarily cut it. You need a left-to-right ordering that's compatible with the topological sort.
  • G wasn't computed last time, but now that C is false it's going to be necessary. You're only going to discover this while you're in the middle of running the node on the right.
  • F was computed last time, but it's not actually needed anymore. Nothing to the left of F is going to help you discover this.

After re-reading this, I wonder: How does a system which pushes "dirtiness state" down the graph solve for this problem? The watcher will receive a notification that a node is dirty before the change has fully propagated the dirtiness state through the graph, right? Unless the push phase of the dirty flag happens in one transaction and then all relevant watchers are called once this phase has completed?

shaylew commented 1 week ago

After re-reading this, I wonder: How does a system which pushes "dirtiness state" down the graph solve for this problem? The watcher will receive a notification that a node is dirty before the change has fully propagated the dirtiness state through the graph, right? Unless the push phase of the dirty flag happens in one transaction and then all relevant watchers are called once this phase has completed?

This is essentially the reason you're not allowed to get a node during a watcher callback, and are supposed to schedule a reaction instead. A two-pass model could also work. The important thing is you can't read the values of nodes while propagation is still happening, or else Computeds can't guarantee an up-to-date, correct value.