llama-cpp Search Results

1000+ results
for llama-cpp

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

abetlen/llama-cpp-python #1277

Unable to build SYCL

# Prerequisites I am running the latest code. Development is very rapid so there are no tagged versions as of now. I carefully followed the [README.md](https://github.com/abetlen/llama-cpp-python/b…

DDXDB updated 7 months ago
7
iaalm/llama-api-server #49

[FEATURE] Upgrade llama-cpp-python to latest version

The version of llama-cpp-python this project uses is quite old. Therefore I get a lot of errors regarding versions of GGML models. It also doesn't support GGUF models. I would suggest to up the ver…

RomainMendez updated 1 year ago
2
ollama/ollama #7148

runner crashes with more than 15 GPUs

### What is the issue? I have deployed ollama using the docker image 0.3.10. Loading "big" models fails. llama3.1 and other "small" models (e.g. codestral) fits into one GPU and works fine. llama3.1…

scriptbotprime updated 1 month ago
4
ggerganov/whisper.cpp #1055

Error: ‘ggml_cuda_load_data’ was not declared in this scope;…

Pulled latest with updated llama.cpp in the talk-llama example. Build is failing on: https://github.com/ggerganov/whisper.cpp/blob/master/examples/talk-llama/llama.cpp#L1116 `WHISPER_CUBLAS=…

cdeisler updated 1 year ago
6
Adriankhl/godot-llm #22

Model is not running on GPU

Even though I'm using a gpu build, inference is running on cpu/ram. I tried to tinker with parameters, but with no luck. Log: ``` Godot Engine v4.3.stable.mono.official.77dcf97d8 - https://go…

SkySlider updated 3 weeks ago
2
utilityai/llama-cpp-rs #540

example to use JSON outputs?

hi, trying to find out how to use JSON outputs, any example appreciated, digging into the code in the meantime!

louis030195 updated 1 month ago
1
cosmicoptima/loom #23

Ability to pass through custom parameters to APIs

Ability to pass through custom JSON parameters to APIs Useful for things like: - Anthropic steering - Customizing Chapter II ems - llama.cpp custom sampling parameters The most gormed way t…

ampdot-io updated 2 months ago
2
zylon-ai/private-gpt #1405

CUDA BLAS GPU support for docker image

When I run the docker container I see that the GPU is only being used for the embedding model (encoder), not the LLM. I noticed that llama-cpp-python is not compiled properly (Notice: BLAS=0), as d…

jannikmi updated 9 months ago
9
ollama/ollama #6649

Intel GPU - model > 4b nonsense?

### What is the issue? qwen4b works fine, all other models larger than 4b are gibberish ``` time=2024-09-05T11:35:49.569+08:00 level=INFO source=download.go:175 msg="downloading 8eeb52dfb3bb in 1…

cyear updated 1 week ago
6
janhq/cortex.cpp #323

engine: AMD GPU support

## Overview ## Tasklist - [ ] Can this be solved via llama.cpp? (e.g. compiled for Vulkan and ROCm) - [x] https://github.com/janhq/cortex.llamacpp/issues/9 - [ ] [https://github.com/janhq/jan/issues…

hiento09 updated 1 month ago
16

上一页 1...92 93 94 95 96 97 98...100 下一页

1000+ results for llama-cpp

1000+ results
for llama-cpp