haoliuhl / ringattention

Transformers with Arbitrarily Large Context
Apache License 2.0
630 stars 50 forks source link

improved llama sharding #7

Closed haoliuhl closed 1 year ago