Open Sazu-bit opened 8 months ago
When you see that it's using 4.10GiB out of 15.6GiB RAM, is that including the Cache Ram usage? The only reason a system would start using Swap RAM is if the whole of the system RAM is depleted the "mlock" parameter is in 1.53 also
Have you tried seeing if it loads with less GPU layers?
I thought I was clear but apparently not. The 4.10G is my usual RAM load with no LLM model loaded. I have 16GB in total. When I load a model such as MythoMax or Airoborus it goes up to 8.53GB (after I post the first message, before that it's 14.8GB I have no idea why this happens, because it basically decreases and any swap used disappears). I have 8GB of VRAM so that's where all the extra expected memory is going (using CLBlast since I can't use Rocm, rocm uses /opt and it's fecking huge (13GB), I don't have enough space on my root partition (using 22/30GB) to accommodate it otherwise I'd be using Hipblast, this is the true for 1.53 and 1.57).
The plan to use mlock, I am aware it's in 1.53, but because it's a new activity I wanted to make sure that I was running on the latest version, but I can't get the latest version to run.
I currently run quite happily in 1.53 with 38 layers and I don't see why I can't run the same thing in 1.57 with or without m-lock enabled. I get the "not enough space" either way. It looks like it's tied to the context... the error message suggesting it needs 32768, but only 4096 is allocated. I have tried setting the context to 32768, but it didn't make a difference.
Yes I've tried loading the model in 1.57 with fewer layers, but I am getting the same error message.
For what it's worth I have an i7 processor, 16GB DDR3 RAM and 8GB VRAM, I can run this model just fine with: https://aur.archlinux.org/packages/koboldcpp-clblast , unfortunately it's 1.53.
I am currently using 4.10G out of 15.6G, so I trust I have enough space in my RAM (normally when this model is running I'm using around 95% of my vram and an additional 5 or 6GB in my RAM. The only reason I was updating was because I plan to use the mlock... this thing keeps shifting to swap and my swap is really really really slow. It does it automatically despite having an additional 4GB to play with hence the mlock.
I've switched back to 1.53 temporarily but can still test with 1.57 if further guidance is needed. I'm not sure how to run the ptrace, would need some assistance on that.