llama-cpp-python Search Results

1000+ results
for llama-cpp-python

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

Aider-AI/aider #2209

Feature request: support for llama.cpp

llama.cpp running in server mode, how to use this? any documentation on usage?

kolinfluence updated 1 week ago
5
microsoft/T-MAC #30

How to Fully Utilize the Optimized Performance of T-MAC ?

I followed the documentation to run the llama2-7b model (4-bit quantized) and also ran it on llama.cpp for comparison. I noticed that, except for nt=1, where there was a slight performance improvement…

ma-hang updated 3 months ago
2
NVIDIA/TensorRT-LLM #1805

How to test the time to new token of a model in Tensorrt-llm

I found that in the benchmark/suite has the output time to first token. However, when I run `python benchmark.py --model meta-llama/Llama-2-7b-hf static --isl 128 --osl 128 --batch 1` an error occurs:…

Ourspolaire1 updated 2 weeks ago
9
skeeto/w64devkit #159

ld.exe unable to find import libraries

While building a mixed C and Python wheel, I got the following error: ```sh ~ $ pip install -U llama_cpp_python Requirement already satisfied: llama_cpp_python in c:\users\ךינשגכהד\scoop\apps\pytho…

clin1234 updated 3 months ago
2
Maximilian-Winter/llama-cpp-agent #54

Slow processing of follow-up prompt

In a multi-turn conversation I see that the combination of llama-cpp-python and llama-cpp-agent is much slower on the second prompt than the python bindings of gpt4all. See the 2 screenshots below. Th…

woheller69 updated 4 months ago
3
NVIDIA/TensorRT-LLM #521

gptManagerBenchmark performance issues

GPU 2*V100 build script: ```shell python build.py --model_dir /data/models/vicuna-13b-v1.5/vicuna-13b-v1.5/ \ --dtype float16 \ --use_gpt_attention_plugin float1…

sleepwalker2017 updated 1 week ago
2
janhq/models #25

Llama3.2-11b-Vision

gabrielle-ong updated 2 weeks ago
2
simonw/llm #565

Installation borked after installing llm-gpt4all

### Background / context - was originally following install instructions for Mac at https://simonwillison.net/2023/Aug/1/llama-2-mac/ - yeah, I should have spotted that this was an older post....bu…

jeremymcmullin updated 1 month ago
5
NVIDIA/TensorRT-LLM #953

NCCL errors while running LLAMA2 70b benchmark shmoo with ba…

### System Info - CPU Arch x86 - 4 H100 CPUs - using commit 6cc5e177ff2fb60b1aab3b03fa0534b5181cf0f1 ### Who can help? @kaiyux @byshiue ### Information - [ ] The official example scripts - [X…

anchorbob updated 1 week ago
7
abetlen/llama-cpp-python #576

How to use GPU?

I run llama cpp python on my new PC which has a built in RTX 3060 with 12GB VRAM This is my code: ``` from llama_cpp import Llama llm = Llama(model_path="./wizard-mega-13B.ggmlv3.q4_0.bin", n_ctx=…

imwide updated 2 months ago
21

上一页 1...18 19 20 21 22 23 24...100 下一页

1000+ results for llama-cpp-python

1000+ results
for llama-cpp-python