llama-cpp Search Results

1000+ results
for llama-cpp

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

ollama/ollama #6953

AMD ROCm Card can not use flash attention

### What is the issue? My cards is w7900, and rocm driver is 6.3 , I found the llama-cpp server started by Ollama always without -fa flag. I check the code , found : …

superligen updated 1 month ago
3
edwko/OuteTTS #13

Deploying with LMStudio

Hi Team, I am already using LMStudio and OLLAMA for model deplyments. Given this model is LMCPP compatible and uses that. How can this model be deplyment, hosted and used with LMStudio or OLLAMA. It …

tushar-31093 updated 1 week ago
1
abetlen/llama-cpp-python #943

Build/ Installation problem

# Expected Behavior I tried to install llama via poetry and it didnt work # Current Behavior it just prompted some information that i dont understand, tried checking, asked for help and it …

oskaars updated 11 months ago
1
janhq/cortex.cpp #470

hardware: Intel iGPU, dGPU and NPU support

## Overview - Intel's Lunar Lake is releasing soon, which has CPU, NPU and iGPU in a single chip ## Tasklist - [x] https://github.com/janhq/cortex.cpp/issues/677 - [x] https://github.com/janhq/cort…

xiangyang-95 updated 1 month ago
4
getumbrel/llama-gpt #121

Error when running model other than 7b

Hi, I wanted to try the model code-7b, but I got this error : ``` llama-gpt-llama-gpt-ui-1 | [INFO wait] Host [llama-gpt-api:8000] not yet available... llama-gpt-llama-gpt-api-1 | /usr/local/…

Impre-visible updated 7 months ago
8
coolbutuseless/rllama #3

keep this up to date with llama.cpp

Hi, do you think I can just drop newer versions of llama.cpp, ggml.h ggml.c etc into src to keep this up to date with llama.cpp? Or is there more too it?

MichelNivard updated 1 year ago
2
EricLBuehler/mistral.rs #763

Slow CUDA inference speed

This reports mistral.rs as being faster than llama.cpp: https://github.com/EricLBuehler/mistral.rs/discussions/612 But I'm seeing much slower speeds for the same prompt/settings. Mistral.rs ``…

ShelbyJenkins updated 2 months ago
2
twinnydotdev/twinny #298

Separate options for amount of lines 'before' and 'after' th…

**Is your feature request related to a problem? Please describe.** When editing the beginning of a long file, prompt evaluation takes a lot of time. Reason for that - in `Additional context` Curr…

AndrewRocky updated 2 months ago
1
abetlen/llama-cpp-python #1431

Unable to install llama-cpp-python with CUBLAS or CUDA enabl…

I'm attempting to install llama-cpp-python under the tensorflow-gpu docker image (nightly build) . When I attempt to do so, I get the following error messages. ```` root@a1f1e127514b:/tf# CMAKE_A…

brent-halen updated 6 months ago
1
ollama/ollama #6908

How to use embedding models from huggingface hub?

Hi thanks for the lib! I want to use some embedding models (arch is bert) from hf hub. I have tried gguf, but the converter says bert arch cannot be converted to that. I have also tried directly have …

fzyzcjy updated 2 weeks ago
4

上一页 1...82 83 84 85 86 87 88...100 下一页

1000+ results for llama-cpp

1000+ results
for llama-cpp