-
Your **PowerInfer** is an amazing work to achieve great performance! Inspired by your brilliant ideas, I myself am thinking about development new features based on `llama.cpp`.
However, it is a bit…
-
### What is the issue?
I successfully converted jina-embeddings v2 base zh to gguf through llama. cpp and imported it into llama。
Here is my Modelfile
> root@buaa-KVM:~/1T/ollama/Jina-AI-embeddin…
-
# Prerequisites
Please answer the following questions for yourself before submitting an issue.
- [x] I am running the latest code. Development is very rapid so there are no tagged versions as of…
-
- Windows is largely untested. Don't know of anyone who has it working natively, only in WSL
- Known issues on windows: https://github.com/exo-explore/exo/issues/184 , tinygrad not officially support…
-
Bonjour,
avez-vous prévu de mettre à disposition sur https://huggingface.co/AgentPublic les différents modèles Albert au format GGUF , comme le fait [TheBloke](https://huggingface.co/TheBloke?search…
-
Hello, i have problems with Llama-cpp-python newest versions.
The generations seems fine, but in reality using the same gguf i receive completely stupid responses, or weird artifacts.
For exampl…
-
Hi,
Getting this error when I am trying to summarize a document using llama-3 (via llama_cpp and llamaindex). I just found that it's a llama_cpp error and not LlamaIndex error. If I downgrade to ll…
-
When using the latest update with how I've setup my embedded install, I ran into this issue:
```
No module named 'llama_cpp.llama_chat_format'
Traceback (most recent call last):
File "Z:\Anybot…
-
Hi,
I am trying to run it on my mac, however it shows that cannot find the shared libray "llama"
Here is the error message.
`
(llama) chen@chenshuodeMBP-9 llama-cpp-python % python3 -m llama_cp…
-
To properly manage antiprompts users must have two options available:
* break after the antiprompt was generated
* backtrack to when before the antiprompt was generated and continue from there
…