llama-cpp Search Results

1000+ results
for llama-cpp

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

ggerganov/llama.cpp #9583

Bug: Templates are swapped for Mistral and Llama 2 in llama-…

### What happened? Chat template formatting seems to be swapped for Mistral and Llama 2. Llama2 supports the `` token for system messages, while Mistral simply uses newlines. Starting llama ser…

StrangeBytesDev updated 1 week ago
2
huggingface/swift-transformers #30

Memory efficiency

Hey Guys, This is a great library, but I have a question. Is this library is able to use memory as efficiently as the Llama.cpp library? In otherwords, if I'm using a checkpoint that I use with Llama…

hassanzadeh updated 2 months ago
2
abetlen/llama-cpp-python #741

Publish all wheels to PyPI

It looks like PyPI only has the source distribution for each release: https://pypi.org/project/llama-cpp-python/0.2.6/#files But the GitHub release at https://github.com/abetlen/llama-cpp-pytho…

simonw updated 6 months ago
3
ollama/ollama #3023

Mamba State Space Models Integration

There has been a completed merge of mamba model support over at Ilama.ccp, would it be possible to implement these into Ollama as well? Merged PR: https://github.com/ggerganov/llama.cpp/pull/5328 …

MarcellM01 updated 1 day ago
6
ggerganov/llama.cpp #8334

Encounter the "newline in constant" error while compiling wi…

### What happened? I used `cmake -B build` to generate a Visual Studio solution. After that, when compiling `test-grammar-integration.cpp` with MSVC, the error "newline in constant" occurred. Here …

Yan-Xiangjun updated 1 week ago
5
abetlen/llama-cpp-python #666

OSError: exception: access violation reading 0xFFFFFFFFFFFFF…

I'm trying to implement the low level API into my own program, loading the model(I am using Pygmalion-13B.ggmlv3.Q6_K.gguf) works fine and I get no errors. Now when I try to evaluate the model via lla…

KeksMember updated 9 months ago
1
hazelnutcloud/godot-llama-cpp #2

Building instructions

### Godot version v4.2.2 ### godot-cpp version latest ### System information Windows 11 ### Issue description Can you add some more details on how to build this addon? It seems like it uses zi…

JanWerder updated 3 months ago
2
intel/neural-speed #174

Performance Gap between Neural Speed Matmul Operator and Lla…

I’ve discovered a performance gap between the Neural Speed Matmul operator and the Llama.cpp operator in the Neural-Speed repository. This issue was identified while running a benchmark with the ONNXR…

aciddelgado updated 3 months ago
13
jehna/humanify #92

Where is the output?

I ran the command like this: ```bash bun x humanifyjs local responsez.js ggml_vulkan: Found 1 Vulkan devices: Vulkan0: NVIDIA GeForce GTX 1070 (NVIDIA) | uma: 0 | fp16: 0 | warp size: 32 [nod…

Duoquote updated 2 weeks ago
4
nextcloud/integration_openai #44

[Bug]: OpenAI error: Client error: `POST http://192.168.73.7…

### ⚠️ This issue respects the following points: ⚠️ - [X] This is a **bug**, not a question or a configuration/webserver/proxy issue. - [X] This issue is **not** already reported on [Github](https://…

Stargate256 updated 1 month ago
9

上一页 1...72 73 74 75 76 77 78...100 下一页

1000+ results for llama-cpp

1000+ results
for llama-cpp