Open Chengwei-Yan opened 5 months ago
Dear author, I have a question. Normally, TransformerEncoderLayer processes three-dimensional data, but I observed that it appears to be processing two-dimensional data here. Why does this approach work effectively?
Dear author, I have a question. Normally, TransformerEncoderLayer processes three-dimensional data, but I observed that it appears to be processing two-dimensional data here. Why does this approach work effectively?