Closed davidweichiang closed 3 years ago
Branch https://github.com/namedtensor/notation/tree/dual shows what some of these options look like for RNN, self-attention, and MVN.
I think the two-name contraction operator might have won. It's simple and it eliminates the most renamings (all but one). It just doesn't look very nice, but hopefully it wouldn't be used too much.
Do any of these look better?
@srush @boazbk
The first is definitely my favorite.
Some people like that section though :)
I also prefer the first!
First meaning matrix-shaped axes, or the two-name contraction?
I mean that
Is a good choice for dot product where we match the ax1 axis of A with the ax2 axis of B
(also what I meant)
Ah, ok. And among the different solutions, do you like that two-name contraction the best too?
Yes. Its not the most interesting, but I think it fits our goals.
Section 6.2 deals with the problem, most visible in RNNs, where you want a linear transformation from an axis to itself, which currently requires renaming. There are four solutions proposed:
and a fifth solution would be:
I think I'd like to cut this down to just one, and to make it part of the main document.