lucidrains / x-transformers

A concise but complete full-attention transformer with a set of promising experimental features from various papers
MIT License
4.63k stars 395 forks source link

add xval wrapper and autoregressive wrapper #195

Closed lucidrains closed 11 months ago

lucidrains commented 11 months ago

https://arxiv.org/abs/2310.02989