NolanoOrg / cformers

SoTA Transformers with C-backend for fast inference on your CPU.
MIT License
311 stars 29 forks source link

GPT-NeoX and Pythia Style models (Open-Assistant) at int-4 #14

Closed Ayushk4 closed 1 year ago