issues
search
mit-han-lab
/
streaming-llm
[ICLR 2024] Efficient Streaming Language Models with Attention Sinks
https://arxiv.org/abs/2309.17453
MIT License
6.38k
stars
355
forks
source link
Support mistral-7b?
#75
Open
spring1915
opened
6 months ago