LuLuLuyi / LongHeads

LongHeads: Multi-Head Attention is Secretly a Long Context Processor
27 stars 1 forks source link