llama-cpp Search Results

1000+ results
for llama-cpp

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

ObrienlabsDev/machine-learning #13

Google Gemma 7B 2B OSS models are available on Hugging Face …

see #27 https://ai.google.dev/gemma/docs?hl=en https://www.kaggle.com/models/google/gemma Gemma on Vertex AI Model garden https://console.cloud.google.com/vertex-ai/publishers/google/model-gard…

obriensystems updated 4 months ago
22
mudler/LocalAI #976

SIGSEGV: GGML_ASSERT: /build/go-llama-stable/llama.cpp/ggml-…

**LocalAI version:** ``` v1.25.0-cublas-cuda12-ffmpeg ``` **Environment, CPU architecture, OS, and Version:** ``` # uname -a Linux localai-ix-chart-f8bbbb7c7-x6xx9 6.1.42-production+truen…

racerxdl updated 10 months ago
2
ggerganov/whisper.cpp #1186

Seems like whisper.cpp talk-llama doesn't yet support newer …

Trying to load some more recent Q5_K_M models using talk-llama and getting errors about tensor type 13. @ggerganov can you please update talk-llama to work with the latest llama.cpp? Thank you!

igorbarshteyn updated 8 months ago
1
simonw/llm #309

Following simonw's blog yields `Error:`

Hi! I have followed every step in [Run Llama 2 on your own Mac using LLM and Homebrew](https://simonwillison.net/2023/Aug/1/llama-2-mac/), in particular: ``` pipx install llm # python 3.11 llm in…

hacker-DOM updated 9 months ago
6
QwenLM/qwen.cpp #69

[BUG] Qwen-1.8-Chat，用llama.cpp量化为f16，然后推理回答错乱，请问1.8在llama.cp…

### 是否已有关于该错误的issue或讨论？ | Is there an existing issue / discussion for this? - [x] 我已经搜索过已有的issues和讨论 | I have searched the existing issues / discussions ### 该问题是否在FAQ中有解答？ | Is there an existing…

Lyzin updated 2 months ago
5
ollama/ollama #1612

Add support for RWKV

Not sure if this RNN counts as a LLM, but if so would be nice to have it, let me know what needs to be done with packaging. https://www.rwkv.com/

kristianpaul updated 3 weeks ago
9
ollama/ollama #5360

Support for Snapdragon X Elite NPU & GPU

Hi all. I just got a Microsoft laptop7, the AIPC, with Snapdragon X Elite, NPU, Adreno GPU. It is an ARM based system. But I found that NPU is not running when using Ollama. Would it be suppo…

flyfox666 updated 2 weeks ago
27
ggerganov/llama.cpp #9665

Bug: nvidia-container-cli: requirement error: unsatisfied co…

### What happened? cmd: docker run --rm -it --gpus all ghcr.nju.edu.cn/ggerganov/llama.cpp:full-cuda --version output: ``` docker: Error response from daemon: failed to create task for container:…

wencan updated 2 weeks ago
1
NousResearch/Hermes-Function-Calling #5

Add support for Hermes Pro function calling to llama-cpp-pyt…

Hey, thank you so much for the great model and this repo! Would you be willing to add support for this chat format to llama-cpp-python, so that we can use function calling (and JSON mode) with thei…

Benjoyo updated 7 months ago
7
intel/neural-speed #174

Performance Gap between Neural Speed Matmul Operator and Lla…

I’ve discovered a performance gap between the Neural Speed Matmul operator and the Llama.cpp operator in the Neural-Speed repository. This issue was identified while running a benchmark with the ONNXR…

aciddelgado updated 5 months ago
13

上一页 1...94 95 96 97 98 99 100...100 下一页

1000+ results for llama-cpp

1000+ results
for llama-cpp