Closed richardbaihe closed 4 years ago
This paper proposes the chunked self-attention with a global memory module.
This paper proposes the chunked self-attention with a global memory module.