Closed DavidUdell closed 2 months ago
Remaining todos here are:
Node level stuff is done. All that remains for the gradient-based implementation is the edge-level stuff.
Edge level stuff done, excluding MLP-out and attn-out.
Remaining todos here are: