chatglm3 support? - Githubissues

tomaarsen / attention_sinks

Extend existing LLMs way beyond the original training length with constant memory usage, without retraining

Apache License 2.0

650 stars 41 forks source link

Open ScottishFold007 opened 7 months ago

ScottishFold007 commented 7 months ago