Open dinopio opened 4 days ago
AFAIK, you do not have to do 'ollama run model' to use it with Ottodev. As long as it has been downloaded and is listed in 'ollama list' it should work just fine.
When you run it, you put it in memory and then you load it one more time when you use it in Ottodev. Hence the double memory issue.
Hope this made sense.
AFAIK, you do not have to do 'ollama run model' to use it with Ottodev. As long as it has been downloaded and is listed in 'ollama list' it should work just fine.
When you run it, you put it in memory and then you load it one more time when you use it in Ottodev. Hence the double memory issue.
Hope this made sense.
this isnt whats happening, i showed the loaded model of ollama to show the acutal GPU used. when the UI loads it (clean) its double the size
Update:
I believe I've identified the root of the problem: need to adjust OLLAMA_NUM_PARALLEL
I am experiencing the same issue.
I have 2 x 3090 and 2 x A30. When I run bolt.new and select qwen2.5-coder:14b model (No ctx modified). ollama ps
gives a result like blow:
I think you have set your ollama env : OLLAMA_NUM_PARALLEL=2 . . In your case, you need to set it to 1
AFAIK, you do not have to do 'ollama run model' to use it with Ottodev. As long as it has been downloaded and is listed in 'ollama list' it should work just fine. When you run it, you put it in memory and then you load it one more time when you use it in Ottodev. Hence the double memory issue. Hope this made sense.
this isnt whats happening, i showed the loaded model of ollama to show the acutal GPU used. when the UI loads it (clean) its double the size
Describe the bug
I have 2x3090 total 48gb vram FROM qwen2.5-coder:32b PARAMETER num_ctx 13108
these settings load a 46GB model as seen below but when I select qwen2.5-coder-extra:32b it ignores this value and loads a different setup as seen below
Steps to reproduce
Selecting it from the list gives this results which uses RAM and CPU instead of the 46GB version:
is there something I have missed from the ollama setup?
Expected behavior
Load the expected 46GB model which was based on PARAMETER num_ctx 13108
Screen Recording / Screenshot
No response
Platform