Open Qualzz opened 1 week ago
Could this be related? I use QuantFactory/internlm2_5-7b-chat-1m-GGUF
quantized model with LMStudio.
Something seems broken with either the GGUF or the implementation yup.
Hi @Qualzz which prompt are you using? I'm seeing okay results with that context window:
ollama run internlm2
>>> /set parameter num_ctx 32768
Set parameter 'num_ctx' to '32768'
>>> Write me a long story
**Title: The Echoes of Eternity**
In the heart of a realm where time flowed like water through veins, there existed an enigmatic place known as Eldoria. A land bathed in perpetual
twilight and whispered secrets by the silvery whispers of the moonbeams. Here, amongst towering crystal spires that pierced the heavens, dwell the
guardians—ancient beings who preserve the balance between life's myriad paths.
Let me know - can look into this
I don't have an exemple right now, but it's similar to the screenshot posted Eligop Especially for long prompt and asking to summary, that kind of stuff.
What is the issue?
Seems like something is wrong with InternLM2.5, I can't get any meaningful out of it. (tried with 32k context)
OS
Linux
GPU
Nvidia
CPU
AMD
Ollama version
v0.1.48