llama-cpp Search Results

1000+ results
for llama-cpp

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

mudler/LocalAI #2015

I would like to change the settings of llama-cpp in detail, …

I'm using docker's latest-aio-gpu-nvidia-cuda-12. I'm using multiple GPUs. I would like to change the settings of llama-cpp in detail, but which file should I change? I am modifying aio/gpu-8g/t…

Taikono-Himazin updated 3 months ago
7
ggerganov/llama.cpp #8577

Support Mistral-Nemo-Instruct-2407 128K

### Prerequisites - [X] I am running the latest code. Mention the version if possible as well. - [X] I carefully followed the [README.md](https://github.com/ggerganov/llama.cpp/blob/master/README.md)…

mirek190 updated 2 weeks ago
54
simonw/llm #309

Following simonw's blog yields `Error:`

Hi! I have followed every step in [Run Llama 2 on your own Mac using LLM and Homebrew](https://simonwillison.net/2023/Aug/1/llama-2-mac/), in particular: ``` pipx install llm # python 3.11 llm in…

hacker-DOM updated 8 months ago
6
InternLM/lmdeploy #1489

[Feature] Support QuaRot quantization scheme

### Motivation QuaRot is out https://arxiv.org/abs/2404.00456 for three weeks. Preliminary results are convincing. Also see discussions in `llama.cpp` with the QuaRot authors. It would be amazing to …

serser updated 5 months ago
1
unslothai/unsloth #692

There is no file 'convert-lora-to-ggml.py' in llama.cpp fold…

'python convert-lora-to-ggml.py my-model' won't work because there is no file 'convert-lora-to-ggml.py' in llama.cpp folder any more.

lab-peng updated 3 months ago
1
eth-sri/lmql #209

Override Open AI API Base with llama.cpp mock server

I have a local server running an OpenAI compatible API. I simply want all requests that normally go to `api.openai.com:443` go to `localhost:8000`. I did see that you should be able to [overide mo…

spyderman4g63 updated 7 months ago
16
NVIDIA/TensorRT-LLM #1562

gptManagerBenchmark seems to go into a dead loop with GPU us…

I run this on GPU: 2 * A30 with CUDA driver 535.104.12. The docker image is built using `make -C docker release_build CUDA_ARCHS="80-real"` I use the latest code in branch main. ``` commit 89ba1…

sleepwalker2017 updated 1 month ago
3
ggerganov/whisper.cpp #1055

Error: ‘ggml_cuda_load_data’ was not declared in this scope;…

Pulled latest with updated llama.cpp in the talk-llama example. Build is failing on: https://github.com/ggerganov/whisper.cpp/blob/master/examples/talk-llama/llama.cpp#L1116 `WHISPER_CUBLAS=…

cdeisler updated 1 year ago
6
abetlen/llama-cpp-python #1561

Switch to disable adding BOS token

**Is your feature request related to a problem? Please describe.** I am building the prompt myself and calling ``` llm.create_completion(prompt, max_tokens=max_tokens, …

etemiz updated 2 months ago
1
janhq/cortex.cpp #295

feat: Support Function Calling in llama3.1

## Goal - llama3.1 should support Tool use in llama.cpp - https://github.com/janhq/models/issues/16 ## Original post **Problem** AFAICS, the current implementation does not have OpenAI Function Cal…

ChristianWeyer updated 3 days ago
6

上一页 1...89 90 91 92 93 94 95...100 下一页

1000+ results for llama-cpp

1000+ results
for llama-cpp