llama-cpp-python Search Results

1000+ results
for llama-cpp-python

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

Mozilla-Ocho/llamafile #207

Fork `openai` Python package to support llama.cpp specific f…

To apply grammar to chat completion, it looks like the llamafile server is expecting the argument `grammar`: https://github.com/Mozilla-Ocho/llamafile/blob/main/llama.cpp/server/server.cpp#L2551 ``` …

tybalex updated 10 months ago
3
abetlen/llama-cpp-python #1123

Using the GPU is slower than the CPU

I installed llamacpp using the instructions below: CMAKE_ARGS="-DLLAMA_CUBLAS=on" pip install llama-cpp-python the speed: llama_print_timings: eval time = 81.91 ms / 2 runs ( 40…

AreckOVO updated 5 months ago
10
ml-explore/mlx-examples #832

Fusing adapters with llama3 cause bad performances

Hello, I'm using the following script to fine tune the llama3 model with a custom dataset of questions & responses using the `{'prompt: "", completion:""}` format defined [here](https://github.com/…

Timelessprod updated 4 months ago
11
abetlen/llama-cpp-python #1674

How to improve GPU utilization

I've noticed that the GPU utilization is very low during model inference, with a maximum of only 80%, but I want to increase the GPU utilization to 99%. How can I adjust the parameters? GPU Name …

xiangxinhello updated 3 months ago
1
abetlen/llama-cpp-python #722

not installing

Hello I am a complete noob so I don't know if I have provided enough informations to be helped. but I need help in this please # Prerequisites Please answer the following questions for yourself …

adouc updated 11 months ago
2
abetlen/llama-cpp-python #1483

cohere multilanguage aya model seem can not use response_for…

Use gguf of https://huggingface.co/CohereForAI/aya-23-8B from https://huggingface.co/bartowski/aya-23-8B-GGUF ```python import llama_cpp llm = llama_cpp.Llama.from_pretrained( repo_id="bart…

svjack updated 6 months ago
2
abetlen/llama-cpp-python #1191

I'm loading a gguf but llama-python-cpp says "is it really a…

Installed llama_cpp_python-0.2.43.tar.gz via CMAKE_ARGS="-DLLAMA_CUBLAS=ON -DCMAKE_CUDA_COMPILER=/opt/cuda/bin/nvcc -DTCNN_CUDA_ARCHITECTURES=61" pip install llama-cpp-python llm = Llama(model_pat…

takosalad updated 9 months ago
1
abetlen/llama-cpp-python #1616

Build fail for version from 0.2.80 to 0.2.83

I installed ```llama-cpp-python``` on the system with: **CPU AMD EPYC 7542** **GPU V100** But it raised the exception shown in the image below:

congson1293 updated 3 months ago
2
ollama/ollama #4823

I encountered this error when converting the Tongyi-Finance-…

### What is the issue? `(.venv) [root@bastion ollama]# python llm/llama.cpp/convert-hf-to-gguf.py ./model --outtype f16 --outfile converted.bin INFO:hf-to-gguf:Loading model: model INFO:gguf.gguf_…

wangkai111111 updated 5 months ago
2
zylon-ai/private-gpt #1404

Modifications to use MOE models (Mixtral 8x7B instruct)

Modifying 2 files is all you need. pyproject.toml llama-cpp-python = "^0.2.11" to llama-cpp-python = "^0.2.23" poetry.lock Search llama-cpp-python and update 2 values version = "0.2.23" …

stutteringp0et updated 11 months ago
9

上一页 1...34 35 36 37 38 39 40...100 下一页

1000+ results for llama-cpp-python

1000+ results
for llama-cpp-python