Open icecream0 opened 4 years ago
I think all inputs are treated as two dimensional, so the framework can generalize to images easily.
I think all inputs are treated as two dimensional, so the framework can generalize to images easily.
maybe,thanks a lot. transformer mainly used for nlp, so the design of framework make code reader more difficult
Description
Transformer inputs [batch_size, input_length, 1, hidden_dim], I want to know why inputs shape is not [batch_size, input_length, hidden_dim], have some reasons?please tell me ,thank you ...
Environment information
For bugs: reproduction and error logs