llama-cpp Search Results

1000+ results
for llama-cpp

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

janhq/cortex.cpp #323

engine: AMD GPU support

## Overview ## Tasklist - [ ] Can this be solved via llama.cpp? (e.g. compiled for Vulkan and ROCm) - [x] https://github.com/janhq/cortex.llamacpp/issues/9 - [ ] [https://github.com/janhq/jan/issues…

hiento09 updated 1 month ago
16
vllm-project/vllm #7560

[Feature]: Automatic Prefix Caching and Truncating. Possibil…

### 🚀 The feature, motivation and pitch Currently when using the Automatic Prefix Caching when you truncate the input (for chat related generation) because of the context limit. The Automatic Prefix …

derpyhue updated 3 days ago
2
guidance-ai/guidance #807

llama.cpp generate calls failing with Segmentation Fault whe…

**The bug** Following the examples here https://lightning.ai/lightning-ai/studios/structured-llm-output-and-function-calling-with-guidance#llm-tool-use using tools with llama.cpp and Mistral 8B gguf,…

mcvella updated 6 months ago
5
abetlen/llama-cpp-python #1505

Improve pre-built wheel CI times by only building llama.cpp …

Kinda self explanatory from the title, right now each python version for a given target builds llama.cpp independently. This artificially limits how many platforms we can support by blowing up ci buil…

abetlen updated 5 months ago
1
EricLBuehler/mistral.rs #679

WSL2 Docker error loading llama-3.1 gguf

## Describe the bug ### My environment Windows 11 Pro, Docker Desktop, WSL2 Ubuntu Engine, latest nvidia driver ### Cuda test I made sure the Docker WSL2 Cuda implementation works correctly by…

underlines updated 1 month ago
2
jupyterlab/jupyter-ai #389

Custom local LLMs

What about custom/private LLMs. Will there be an option to use some of longchain local features like llama.cpp?

zboinek updated 1 month ago
37
abetlen/llama-cpp-python #375

llama_cpp installs into base instead of Conda for M1 install…

# Prerequisites Please answer the following questions for yourself before submitting an issue. - [x] I am running the latest code. Development is very rapid so there are no tagged versions as of…

nivibilla updated 10 months ago
4
trzy/llava-cpp-server #8

CMake Build?

Hello, is it possible to compile with cmake? Because with make it doesn't detect cuda

danielelongo14 updated 2 months ago
2
guidance-ai/guidance #978

LlamaCpp always using gpt2 tokeniser

**The bug** When using `models.LlamaCpp` the selected tokenizer is always gpt2 (This can be seen in the outut when `verbose=True` arg is set). I have pasted the dumped KV metadat keys ``` llama_mod…

prnvbn updated 3 months ago
1
abetlen/llama-cpp-python #1123

Using the GPU is slower than the CPU

I installed llamacpp using the instructions below: CMAKE_ARGS="-DLLAMA_CUBLAS=on" pip install llama-cpp-python the speed: llama_print_timings: eval time = 81.91 ms / 2 runs ( 40…

AreckOVO updated 5 months ago
10

上一页 1...93 94 95 96 97 98 99...100 下一页

1000+ results for llama-cpp

1000+ results
for llama-cpp