Open 233function opened 3 months ago
I was wondering the same thing. Will have to study the source code to find some llama-specific tricks, I guess
In fact, Qwen has a similar architecture to Llama, and you can follow Llama's lead in supporting the Qwen family of models.
Hello, author. When will the code framework support the extension of the context window for the Qwen series of models?