Open Summer-sss opened 3 years ago
Hello, may I ask if the parameters are shared between multiple Transformer blocks? If not, how to achieve it? Thank you very much for your reply.
The paramters are not shared by multiple Transformer blocks, which is achieved by generating Transformer block with for-loop.
Hello, may I ask if the parameters are shared between multiple Transformer blocks? If not, how to achieve it? Thank you very much for your reply.