explosion / curated-transformers

🤖 A PyTorch library of curated Transformer models and their composable components
MIT License
864 stars 34 forks source link

Convert QKV projection splitting methods into Torch modules #343

Open danieldk opened 1 year ago