mit-han-lab / streaming-llm

[ICLR 2024] Efficient Streaming Language Models with Attention Sinks
https://arxiv.org/abs/2309.17453
MIT License
6.59k stars 361 forks source link

How to use streaming llm to train a new model? is there any sample code . thansk #45

Closed mega-cqz closed 11 months ago

Guangxuan-Xiao commented 11 months ago

Hello, StreamingLLM is designed for deploying LLMs with infinite-length inputs without compromising efficiency or performance. It is not intended for training LLMs.