lucidrains / FLASH-pytorch

Implementation of the Transformer variant proposed in "Transformer Quality in Linear Time"
MIT License
344 stars 24 forks source link

minor change to align with paper for better readliness #3

Open chivee opened 2 years ago

chivee commented 2 years ago

I'm changing some local variables for better readliness.