llamacpp Search Results

1000+ results
for llamacpp

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

shell-nlp/gpt_server #16

【请教】大佬请教一下config.yaml

大佬，之前看到这个项目一直在测试模型，我这边目前使用的是llamacpp，因为有公司这边的服务器，所以是用cpu的，速度10t/s。公司内部用足够了。知识库用到的模型： Qwen2.5-14B:q4 bge-reranker-base Dmeta-embedding-zh-small 大佬这是我这边使用的模型，不知道能不能改成使用cpu的，非常感谢。 # 后台启动 noh…

taurusduan updated 3 weeks ago
5
bentoml/OpenLLM #1064

bug: `openllm run phi3:3.8b-ggml-q4` build fails to find FOU…

### Describe the bug Hi! I tried to run an llm locally using `openllm`, and `phi3:3.8b-ggml-q4` happens to be the only model which I am able to run locally according to openllm, so I ran `openl…

sij1nk updated 2 months ago
2
huggingface/chat-ui #1374

Help with .env.local for AWS as an endpoint for llama3 on hu…

there seems to be no configuration for .env.local that I can get to work to connect to a Llama3 inference endpoint hosted by HuggingFace cloud (and I can find no examples). ``` MONGODB_URL=mong…

thams updated 3 months ago
1
ronith256/LocalGPT-Android #3

can I use .guff

I went through the code and it downloads a model from GPT4ALL. How can I add my .guff file to the android project and use it instead? I won't be able to share it on the App Store but thats OK. H…

krecicki updated 7 months ago
1
dottxt-ai/outlines #532

llamacpp example - ValueError: only one element tensors can …

### Describe the issue as clearly as possible: I run `examples/llamacpp_example.py` ``` outlines/models/llamacpp.py:180: FutureWarning: The input object of type 'Tensor' is an array-like implem…

Rubyer77 updated 10 months ago
4
SillyTavern/SillyTavern #2982

[BUG] Top P is stuck in 0 when set 1 in GGUF models. Oobabo…

### Environment 🐧 Linux ### System Mozilla/5.0 Linux x86_64 Firefox/131.0 ### Version staging (last version of this repo) ### Desktop Information Node JS Version Node.js v18.20.4. API oobabo…

reydeljuego12345 updated 1 month ago
2
rmusser01/tldw #135

Feature-Add: Add a perplexity clone

Everyone's gotta have an LLM-powered search engine feature right? https://github.com/developersdigest/llm-answer-engine

rmusser01 updated 1 day ago
8
Atome-FE/llama-node #108

Llama2 quantized q5_1

I am getting this error: ``` llama.cpp: loading model from /Documents/Proj/delta/llama-2-7b-chat/ggml-model-q5_1.bin error loading model: unrecognized tensor type 14 llama_init_from_file: failed…

HolmesDomain updated 1 year ago
1
dottxt-ai/outlines #632

Support for TensorRT-LLM

Outlines currently support the vLLM inference engine, it would be great if it could also support the tensorRT-LLM inference engine.

SupreethRao99 updated 2 months ago
7
IndigoDosSantos/stable-cascade-one-click-installer #35

Feature request (highres)

Adding high Res support although somewhat difficult as already with the two models cascade is hard to run it would make images way more detailed. Also maybe incorporate an option for. A llm like west …

Loko415 updated 8 months ago
3

上一页 1...14 15 16 17 18 19 20...100 下一页

1000+ results for llamacpp

1000+ results
for llamacpp