llama-cpp Search Results

1000+ results
for llama-cpp

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

Aider-AI/aider #2209

Feature request: support for llama.cpp

llama.cpp running in server mode, how to use this? any documentation on usage?

kolinfluence updated 1 week ago
5
TabbyML/tabby #3438

llama-server exit on StarCoder2-7B with quantization

So with ``` tabby_x86_64-manylinux2014-cuda122/llama-server -m /home/mte90/.tabby/models/TabbyML/StarCoder2-7B/ggml/model-00001-of-00001.gguf --cont-batching --port 30890 -np 1 --log-disable --ctx-…

Mte90 updated 5 days ago
6
casper-hansen/AutoAWQ #655

"llama_model_load: error loading model: check_tensor_dims: t…

When I quantified the Qwen2.5-1.5B-instruct model according to "GGUF Export" in the examples.md in the docs, it showed that the quantization was complete and I obtained the gguf model.But when I load …

Autism-al updated 16 hours ago
1
ggerganov/llama.cpp #9881

llama.cpp is slow on GPU

### What happened? llama.cpp is running slow on NVIDIA A100 80GB GPU Steps to reproduce: 1. git clone https://github.com/ggerganov/llama.cpp && cd llama.cpp 2. mkdir build && cd build 3. cmak…

vineel96 updated 1 week ago
8
ShenghaiWang/SwiftLlama #16

Use specific "llama.cpp" version

```swift .package(url: "https://github.com/ggerganov/llama.cpp.git", branch: "master") ``` It is better to use a special version of the package, because master is constantly updated and different r…

ILYA-2606 updated 1 month ago
1
aifoundry-org/wiki #16

explore superstructures on llama.cpp for lora introduction

Need to experiment with this solutions and decide if lora can be implemented on CPU - [x] unsloth - [x] axolotl - [x] llama factory - [x] text-gen-ui *

janchk updated 1 week ago
6
openvinotoolkit/openvino #27736

Looking forward to providing OpenVINO backend support for Ll…

### Request Description Llama.cpp is a very popular and excellent LLM/VLM inference deployment framework, implemented in pure C/C++, without any dependencies, and cross-platform. Based on SYCL and Vu…

Torinlq updated 12 hours ago
1
QwenLM/Qwen2.5 #1101

"llama_model_load: error loading model: check_tensor_dims: t…

When I quantified the Qwen2.5-1.5B-instruct model according to **"Quantizing the GGUF with AWQ Scale"** of [docs](https://qwen.readthedocs.io/en/latest/quantization/llama.cpp.html) , it showed that th…

Autism-al updated 14 hours ago
2
ggerganov/llama.cpp #10321

Bug: llama-gbnf-validator parses grammar but gets a seg faul…

### What happened? I ran` ./llama-gbnf-validator mygrammar.txt mytestprogram.txt `and after checking the grammar itself, it started to parse the test file and it went into an infinite loop calling st…

nissenbenyitskhak updated 1 week ago
1
hiyouga/LLaMA-Factory #3563

Output difference between LLaMA-Factory and llama.cpp

### Reminder - [X] I have read the README and searched the existing issues. ### Reproduction Hi There, I am observing a difference in output between llama factory inference and llama.cpp. I am…

anidh updated 1 day ago
7

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for llama-cpp

1000+ results
for llama-cpp