-
https://github.com/IntrinsicLabsAI/gbnfgen
This would be HUGELY useful. You could take C# classes/records and have it generate JSON grammars for them that guaranteed your responses (if you didn't r…
-
There have been a few reports where the grammar sampling can significantly degrade the performance.
It would be nice to profile and optimize the implementation - there should be room for improvements…
-
### Prerequisites
- [X] I am running the latest code. Mention the version if possible as well.
- [X] I carefully followed the [README.md](https://github.com/ggerganov/llama.cpp/blob/master/README.md)…
jkfnc updated
2 months ago
-
### Have you searched for similar requests?
None found.
### Is your feature request related to a problem? If so, please describe.
Basically all in the title.
More detailed: If there will b…
-
### What happened?
```markdown
>> cd 'G:\LLAMA-CPP\WINDOWS10ACER\llama.cpp-b3661'
>> llama-
llama-baby-llama.exe llama-gguf-split.exe llama-passkey.exe
llama-batch…
-
### What happened?
I have tried to load the `json.gbnf` grammar example but haven't been able to do so. The following code is not working.
```
from llama_cpp.llama import Llama, LlamaGrammar
imp…
-
Hi. I have checked the new version of Wingman, and I am quite disappointed by it: there is no first class support for local models and backends. Apparently the extension was built around the way that …
synw updated
9 months ago
-
Hey, thank you so much for the great model and this repo!
Would you be willing to add support for this chat format to llama-cpp-python, so that we can use function calling (and JSON mode) with thei…
-
### Have you searched for similar [requests](https://github.com/SillyTavern/SillyTavern/issues?q=)?
Yes
### Is your feature request related to a problem? If so, please describe.
N/A
### De…
-
### What happened?
Hello,
I asked Command-R to generate a json structured output.
First without grammar and then with grammar.
The point of this is to adjust the grammar on the run to no confu…