lucidrains / routing-transformer

Fully featured implementation of Routing Transformer
MIT License
282 stars 29 forks source link

normalize queries and keys before dot product #10

Closed lucidrains closed 4 years ago