Open inspir3dArt opened 4 days ago
A few questions:
Bypass Context Length
in settings disabled?I haven't tested 0.8.0, but it was a problem back in v0.7.9g too, with is the last version I think I tested before.
Yes, it is disabled. The model showed the context length correctly as 8192 in the models overview, but was at a lower setting in the model settings by default, so I changed it to 8192 there too.
The following fields are used / Enabled by the character card:
The model card has also entries in the "System Prompt" and "Jailbreak (Post history instructions)" that seems to be not used. I copied the System Prompt text manually to the beginning of the Description field inside the character card edit menu in chatter ui before starting the roleplay.
I had a longer roleplay chat using 0.8.1 with a Q4 k_m gguf quant of L3-8B-Lunar-Stheno. It worked well, until the #42 message, there it took 19 minutes before it replied, like it had to reprocess the entire chat. The models context size is set to 8192.
The log doesn't show anything different than on all other replies, except the big time jump.
The device is a Samsung Galaxy S24 ultra.