Open vansin opened 8 months ago
https://huggingface.co/internlm https://github.com/InternLM/InternLM
I think you should ask the model makers or inference framework authros to support streaming-llm instead.
https://huggingface.co/internlm https://github.com/InternLM/InternLM