lucidrains / local-attention

An implementation of local windowed attention for language modeling
MIT License
370 stars 40 forks source link

question about the local attention #1

Closed benywon closed 3 years ago