llama-cpp Search Results

1000+ results
for llama-cpp

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

abetlen/llama-cpp-python #1780

llama-cpp-python not using GPU on google colab

# Prerequisites Please answer the following questions for yourself before submitting an issue. - [ Yes] I am running the latest code. Development is very rapid so there are no tagged versions as…

AnirudhJM24 updated 3 weeks ago
3
ggerganov/llama.cpp #9944

Bug: Cannot build with C++ > 20

### What happened? Hi there, I was trying to build llama.cpp in a project that uses the C++ 23 standard and there are a lot of errors when building the `llama` target with MSVC. The only fix is to d…

bdashore3 updated 2 weeks ago
3
ggerganov/llama.cpp #10451

Bug: 【CANN】ggml-cann/aclnn_ops.cpp:3007: GGML_ASSERT(n_dims …

### What happened? 按照readme中的固件驱动版本，推理时出现报错 ### Name and Version 最新版本 ### What operating system are you seeing the problem on? _No response_ ### Relevant log output ```shell llama_new_context_w…

zyp2 updated 1 day ago
7
gpustack/gpustack #569

Multicore CPU utilization

Hi, I am experimenting with gpustack and I noticed when hosting LLMs on CPU only with llama.cpp as backend that only one CPU core is being utilised when inquering such LLM. Can multicore processi…

ivlovric updated 6 days ago
1
instructlab/sdg #360

AssertionErrors after starting the SDG

getting assertion error when starting the synthetic data generation INFO 2024-11-10 08:00:58,565 instructlab.model.backends.llama_cpp:232: Starting server process, press CTRL+C to shutdown serve…

acsankar updated 2 weeks ago
1
abetlen/llama-cpp-python #1811

Add support of Qwen2vl

**Is your feature request related to a problem? Please describe.** Currently I am using Qwen2vl, this is the best vlm model for my project. I hope llama-cpp-python can support this model. I tried to …

PredyDaddy updated 1 week ago
2
ollama/ollama #6338

ollama slower than llama.cpp

### What is the issue? When using the llm benchmark with ollama https://github.com/MinhNgyuen/llm-benchmark , I get around 80 t/s with gemma 2 2b. When asking the same questions to llama.cpp in conve…

phly95 updated 1 month ago
10
leejet/stable-diffusion.cpp #469

Support for shuttle-3-diffusion

I downloaded the weights from https://huggingface.co/shuttleai/shuttle-3-diffusion, the program loaded the weights and exit for no error message. I debugged the program, it seems that the problem i…

bombless updated 6 days ago
8
netdur/llama_cpp_dart #42

Question: is the project on hold?

Quote from readme: > This project, a Dart binding for llama.cpp, is currently on hold as we await the porting of llama.cpp helpers code to C Is there a link, potentially to the github issue in the…

yuvalr1 updated 2 days ago
4
exo-explore/exo #167

[BOUNTY - $500] Llama.cpp inference engine

- it should automatically detect the best device to run on - We should require 0 manual configuration from the user, by default llama.cpp for example requires specifying the device

AlexCheema updated 1 month ago
10

上一页 1...3 4 5 6 7 8 9...100 下一页

1000+ results for llama-cpp

1000+ results
for llama-cpp