Closed yxliu-ntu closed 4 years ago
when you forward(), the last layer of cin is not split half, while the fc_input_dim still adds cross_layer_size//=2.
when you forward(), the last layer of cin is not split half, while the fc_input_dim still adds cross_layer_size//=2.