Content: Define operand concept, simplify graph connection steps

webmachinelearning / webnn

🧠 Web Neural Network API

https://www.w3.org/TR/webnn/

Other

398 stars 48 forks source link

Content: Define operand concept, simplify graph connection steps #591

Closed inexorabletash closed 8 months ago

inexorabletash commented 9 months ago

As discussed in #572:

Define an "operator" concept, with inputs, outputs, activations.
Defer "platform operator" and "platform operand" to build.
Standardize the graph connection steps across builder methods.
Simplify activation, input and constant steps.

Not covered in this change:

Erroring if input's [[builder]] doesn't match this
Build algorithm - covered by #448 and #457
gru() is missing steps to populate output. Added "Issue" note.
Introducing "Validate arguments" section for each method.
Introducing "Calculate output shape" section for each method.
Rewording methods that vend MLActivations.

For #549 and #572.

Preview | Diff

inexorabletash commented 9 months ago

One thing that wasn't discussed in #572 in detail is: what to do with MLActivation-vending methods? This PR currently simplifies the "create an MLActivation" steps so that they don't throw, on the assumption that the caller will validate. But none of the invocations do any validation (except clamp()), and they all include "If that throws an error, re-throw the error." (except elu())

I think the right answer is: validate in the caller (e.g. the actual method), just like MLOperand-vending methods. So drop the "re-throw" steps (like elu()), and we can add validation steps if needed (like clamp()). But confirmation would be appreciated.

inexorabletash commented 9 months ago

Another slight tweak we might want to make to this - there are multiple styles used for declaring the operator. Here are all the distinct examples:

MLActivation creation:
- Let operator be an operator for the name operation. (1 usage)
Groups of ops with shared steps:
- Let operator be an operator for the operation op... (1 usage)
- Let operator be an operator for the binary operation op... (3 usages)
- Let operator be an operator for the op pooling operation... (2 usages)
Other ops:
- Let operator be an operator for "gru"... (2 usages)
- Let operator be an operator for the batchNormalization operation... (34 usages)

I'm not sure it really matters, but the "gru" style stands out as the odd duck. That said, there's something to be said for quoting the name, e.g. this reads strangely: "Let operator be an operator for the where operation..."

Spelling is also all over the place, e.g. "Leaky RELU" vs. "LSTM cell" vs. "Gather" vs. "instance normalization" vs. "batchNormalization".

zolkis commented 9 months ago

there are multiple styles used for declaring the operator

That all is probably due to my sloppiness while mass-defining those algorithms at different times and fatigue levels :). Let's choose one and apply it consistently.

huningxin commented 9 months ago

Thanks @inexorabletash ! I am not feeling well today. I'll catch up ASAP.

inexorabletash commented 8 months ago

Okay, sounds like this is ready for a merge - @fdwr maybe one last look?

fdwr commented 8 months ago

Okay, sounds like this is ready for a merge - @fdwr maybe one last look?

@inexorabletash : 🔎👀 ETA 16:10... (Oxford comma for the clarity win 👍)

anssiko commented 8 months ago

Great work everyone!

I believe we were just able to squeeze these changes into the CR Snapshot release train before it departs. I expect the programming model section to be read by people outside this WG, so improvements there were timely.