llama-cpp Search Results

huggingface/lighteval #402

[FT] Support llama.cpp inference

## Issue encountered Currently, inference of open models on my Mac device is quite slow since vllm does not support mps. ## Solution/Feature Llama.cpp does support mps and would significantly spe…

JoelNiklaus updated 4 days ago

instructlab/instructlab #2589

llama-cpp-python build error

**Describe the bug** **To Reproduce** Steps to reproduce the behavior: excute this command： CMAKE_ARGS="-DLLAMA_CUDA=on -DLLAMA_NATIVE=off" pip install 'instructlab[cuda]' and compile err…

teitiyuu updated 1 week ago

amd/RyzenAI-SW #133

Running llama.cpp on Ryzen NPU.

Will there be more support for running llama.cpp on Ryzen NPU chips?

t-AIR-e updated 6 days ago

microsoft/BitNet #10

Relationship to llama.cpp

First of all: CONGRATS ON YOUR AMAZING RESEARCH WORK. Considering that this is using GGML and seems based directly on `llama.cpp`: Why is this a separate project to `llama.cpp`, given that `llama.c…

dokterbob updated 2 weeks ago

cfahlgren1/observers #7

[FEAT] Add support for `llama-cpp-python`

We want to observe interactions of llama-cpp, try to get inspiration from https://github.com/cfahlgren1/observers/blob/main/src/observers/observers/models/openai.py ```python from llama_cpp import…

davidberenstein1957 updated 11 hours ago

janhq/cortex.cpp #1728

epic: llamacpp-engine to align with llama.cpp upstream

**Goal** - cortex.cpp's desktop focus means Drogon's features are unused - We should contribute our vision and multimodal work upstream as a form of llama.cpp server Can we consider refactoring llam…

dan-homebrew updated 11 hours ago

EricLBuehler/mistral.rs #903

Tracking: Metal performance vs. MLX, llama.cpp

This issue serves to track performance on Metal hardware versus MLX and llama.cpp.

EricLBuehler updated 6 hours ago

ggerganov/ggml #1012

Some features of whisper.cpp was removed while combining lla…

Some of features like ggml_graph_plan function was removed from ggml library when combining llama.cpp branch, it seems not using the previous branch from ggml with whisper.cpp is fully supported. I…

Pi-tool updated 1 week ago

QwenLM/Qwen2-VL #7

Could we have support for [Llama.cpp?](https://github.com/ggerganov/llama.cpp) That will make the model more accessible to many popular tools like Ollama, LM Studio, Koboldcpp, text-generation-webui,…

chigkim updated 1 week ago

TheBlewish/Automated-AI-Web-Researcher-Ollama #16

pip install fails on building llama-cpp-python wheel

OS: 22.04.1-Ubuntu Python: Python 3.12.2 Build fails for llama-cpp-python ``` $ pip install -r requirements.txt ... Building wheels for collected packages: llama-cpp-python Building wheel…

Karthik-Dulam updated 2 hours ago

1000+ results
for llama-cpp