RNN problem - Githubissues

namedtensor / notation

108 stars 5 forks source link

RNN problem #29

Closed davidweichiang closed 3 years ago

davidweichiang commented 3 years ago

Section 6.2 deals with the problem, most visible in RNNs, where you want a linear transformation from an axis to itself, which currently requires renaming. There are four solutions proposed:

matrix axes (§6.1.3)
contracting two names (§6.2.1)
stars (§6.2.2)
numbered axes (§6.2.3). I really liked this but I think everyone agrees that it's not very self-explanatory

and a fifth solution would be:

adopt a style that avoids reusing axis names; even the timesteps of an RNN would have distinct names

I think I'd like to cut this down to just one, and to make it part of the main document.

davidweichiang commented 3 years ago

Branch https://github.com/namedtensor/notation/tree/dual shows what some of these options look like for RNN, self-attention, and MVN.

I think the two-name contraction operator might have won. It's simple and it eliminates the most renamings (all but one). It just doesn't look very nice, but hopefully it wouldn't be used too much.

davidweichiang commented 3 years ago

Do any of these look better?