ggml Search Results - Githubissues

ollama/ollama #7590

GGML_ASSERT(ggml_nelements(a) == ne0ne1ne2) failed

### What is the issue? If I try to run the `llama3.2-vision` model using `ollama run llama3.2-vision` on my Arch Linux machine, I get this error: ``` Error: llama runner process has terminated: GG…

Volker-Weissmann updated 22 hours ago

LostRuins/koboldcpp #1238

GGML_ASSERT failed

Currently I'm learning this tool, I'm probably just missing some settings but I don't know which one. While trying to set up image generation I ran into this problem. Version: KoboldCpp - Version 1…

sardior updated 7 hours ago

ollama/ollama #7748

ggml.c:4044: GGML_ASSERT(view_src == NULL || data_size == 0 …

### What is the issue? On certain API requests, the server throws a segmentation fault error and the API responds with a HTTP 500. So far, I have encountered this twice in thousands of requests. Unfo…

pavelruzicka updated 6 days ago

leejet/stable-diffusion.cpp #481

Vulkan: memory management issue with ggml update

When available VRAM becomes low, it looks like the Vulkan backend now allocates compute buffer on the shared memory, which causes very significant slowdowns, even if there is actually enough VRAM avai…

stduhpf updated 7 hours ago

TheBlewish/Automated-AI-Web-Researcher-Ollama #16

pip install fails on building llama-cpp-python wheel

OS: 22.04.1-Ubuntu Python: Python 3.12.2 Build fails for llama-cpp-python ``` $ pip install -r requirements.txt ... Building wheels for collected packages: llama-cpp-python Building wheel…

Karthik-Dulam updated 33 minutes ago

TabbyML/tabby #3438

llama-server exit on StarCoder2-7B with quantization

So with ``` tabby_x86_64-manylinux2014-cuda122/llama-server -m /home/mte90/.tabby/models/TabbyML/StarCoder2-7B/ggml/model-00001-of-00001.gguf --cont-batching --port 30890 -np 1 --log-disable --ctx-…

Mte90 updated 5 days ago

ggerganov/llama.cpp #10453

ggml : add ANE backend

According to this https://github.com/ggerganov/llama.cpp/discussions/336#discussioncomment-11184134, there is a new CoreML API and an ANE backend might be possible to implement with latest Apple softw…

ggerganov updated 3 days ago

ggerganov/llama.cpp #10284

Bug: Build failure with GGML_VULKAN=1 GGML_HIPBLAS=1

### What happened? I tried this combination when I ran what I thought was a Vulkan-enabled build and it said I needed to enable BLAS support to get GPU acceleration, but it was actually just a CPU bu…

nullref updated 1 week ago

intel-analytics/ipex-llm #12426

Disable XMX

Hello. I have an Intel ARC A380 and I'm using Ollama with IPEX-LLM using this script with Ubuntu: ``` #!/bin/bash # Activate conda environment source /home/nikos/miniforge3/etc/profile.d/cond…

NikosDi updated 13 hours ago

leejet/stable-diffusion.cpp #456

Error: ‘ggml_flash_attn’ was not declared in this scope; did…

I complied this repositories on my Debian 12 PC,but it was failed. error: error: ‘ggml_flash_attn’ was not declared in this scope; did you mean ‘ggml_flash_attn_ext’? 681 | struct …

wr131 updated 3 days ago

1000+ results for ggml

1000+ results
for ggml