Closed OliverFM closed 1 year ago
This gets the linear layer a little better -- this converts forward to have the same matrix multiplication order as torch's linear layer
forward
This gets the linear layer a little better -- this converts
forward
to have the same matrix multiplication order as torch's linear layer