WebAssembly / gc

Branch of the spec repo scoped to discussion of GC integration in WebAssembly
https://webassembly.github.io/gc/
Other
1.01k stars 73 forks source link

Post-MVP feature requests #452

Open tlively opened 1 year ago

tlively commented 1 year ago

These are the feature requests that came up at the subgroup meeting on October 3, 2023:

  1. Non-throwing variants of instructions or trap handlers
  2. Constant array.new_elem and array.new_data
  3. initialize-once globals
  4. Feature detection for experimenting with new features
  5. Type imports
  6. Controlling meta-objects/RTTs/headers
  7. Static fields
  8. Weak references
  9. Finalization
  10. Eliminate method receiver casts
  11. Ergonomic JS/WasmGC interop
  12. Arbitrary multi-byte loads and stores to byte arrays (#395
rossberg commented 1 year ago

You meant October 12? :)

Many of those are already listed in the Post-MVP doc in some form, I'll add the others.

  1. initialize-once globals

I don't remember this one. What does it mean exactly?

zapashcanon commented 1 year ago

With @chambart we're going to present our freezable/sealable proposal at the next CG meeting.

It allows to have mutable and/or nullable globals/values only during initialization phase and they become immutable and/or non-nullable after this.

initialize-once globals

It may be one of the things we are proposing.

fitzgen commented 1 year ago

Another post-MVP feature to consider would be a way to compare funcrefs for equality and/or get unique IDs for them or something.

Then I could do speculative inlining of funcrefs as a Wasm-to-Wasm pass in Winliner instead of only for call_indirects through tables that are never mutated.

jakobkummerow commented 1 year ago

a way to compare funcrefs for equality

I agree that this would be useful to have.

Implementation perspective: if we expect that repeated creations of funcrefs for the same function index have distinct identity (in other words, that (ref.eq (ref.func $x) (ref.func $x)) evaluates to 0), then that would be annoying/inefficient to implement. If we specify that funcrefs are implicitly cached/canonicalized, i.e. there is only one unique funcref returned by arbitrarily many executions of (ref.func $x) for a given $x, then that would be trivial to implement -- in fact, we already make use of this property (and pointer-equality comparisons based on it) internally in V8. (To be fair, I cannot say with certainty whether other engines might see this differently, but I would find that somewhat surprising: the obvious way how to make ref.func fast is to cache or even pre-initialize its result, so it doesn't need an allocation every time.)

rossberg commented 1 year ago

One of the known problems with function identity is that it invalidates optimisations like specialisation or any code transformation duplicating/merging function definitions that are referenced.

kripken commented 1 year ago

Previous discussion on function comparisons: https://github.com/WebAssembly/gc/issues/239 , some use cases and data there from a year ago.

Overall I think this can help in the toolchain, as @fitzgen said, but I also agree with @rossberg that the downsides need to be carefully weighed here. Another thing to consider is that if VMs inline at runtime then this probably becomes less important, and for that reason I don't consider this high priority personally.

fitzgen commented 1 year ago

Another thing to consider is that if VMs inline at runtime then this probably becomes less important

FWIW, not all engines / Wasm environments have (or will ever have) JIT capabilities.

wingo commented 3 months ago

For what it is worth, it seems the JS bindings specify that two funcref instances for the same func in the same instance are eq: https://webassembly.github.io/gc/js-api/index.html#exported-function-exotic-objects. Though, I do understand why Wasm might want to avoid closure equality.

Still, for Hoot, I would like to be able to distinguish between different funcrefs; besides specialization, it would be useful for debugging and introspection. All I really need though is to be able to get an i32 associated with the funcref's code.

Perhaps there would be a way to keep funcrefs as non-comparable by default, but allow a module to declare some set of functions as having visible addresses, possibly via designating an element section, where addresses are assigned sequentially starting from 0. Then you could have func.get_addr retrieve the address for a funcref. If the funcref is not from the current instance (module?) it would return -1 or something. Could this be sufficient for the devirtualization use case?

In Hoot probably we have to build something terrible, like putting all funcrefs in a host-side hash table or something. A Wasm facility would be quite welcome.

Edit: Turns out, I need to distinguish between instances; given an open-world system with instances A and B, sequential allocation from 0 within a module doesn't do it. Still working on a solution; I can wrap everything in a struct but that is quite some overhead for a basic ability which is present under the hood (equality and hashing).

rossberg commented 3 months ago

@wingo:

for a basic ability which is present under the hood (equality and hashing).

Well, the reason for not exposing this is that it in fact may not be present under the hood, e.g., due to optimisations etc.

But yeah, this feature is already listed in the Post-MVP doc.

My preferred mechanism would be something a bit more generic, where we allow type definitions to specify whether references to them support equality or not. For functions, this would allow knowing at definition site whether a function may need to be handled carefully to preserve equality. For structs, defining them as non-eq would essentially give us value types. (I thought this was somewhere in the Post-MVP doc, but apparently I never included it there.)