Open nkotak opened 7 months ago
I did find a thread about this again but I was wondering why this would occur since im running this on an Apple Silicon machine with 96gb of ram and the models are all 34B models. Could it be the number of models that is causing the issue?
Have you tried increasing the maximum amount of memory allocated to your GPU? I think by default OSX is set up to only allow you to use ~half of your RAM for the GPU. 34B will take at least 68GB plus some overhead for context length. I think you should be able to get it to work if you play with the setting but I can't say for sure as I don't have a comparable Mac to test with. Here's a post with some info on how: https://old.reddit.com/r/LocalLLaMA/comments/186phti/m1m2m3_increase_vram_allocation_with_sudo_sysctl/
Received this error below, Not sure how to proceed. Thank you!!