llamacpp Search Results

dottxt-ai/outlines #1257

llamacpp_example is broken

### Describe the issue as clearly as possible: `examples/llamcpp_example.py` is broken It seems like the model is producing some garbage output (which shouldnt be allowed by the logit processor). …

bharathc346 updated 1 week ago

dottxt-ai/outlines #1169

Llamacpp backend without other dependencies

### What behavior of the library made you think about the improvement? I need to install torch, transformers, accelerate etc. even if I want to use outlines only with llamacpp backend. Are these d…

mobeetle updated 2 weeks ago

langchain-ai/langchainjs #6994

Error encountered when using LlamaCpp class

### Checked other resources - [X] I added a very descriptive title to this issue. - [X] I searched the LangChain.js documentation with the integrated search. - [X] I used the GitHub search to find a …

chantal-rose updated 1 month ago

OpenBMB/MiniCPM-V #657

[llamacpp] - <title> 为什么llamacpp执行量化模型还要指定一个 f16的 mmproj-mod…

### 起始日期 | Start Date _No response_ ### 实现PR | Implementation PR 在 https://huggingface.co/openbmb/MiniCPM-V-2_6-gguf 这里指定的量化运行，需要指定的参数包括： ./llama-minicpmv-cli -m ../MiniCPM-V-2_6/model/ggml-mode…

friendmine updated 2 weeks ago

dottxt-ai/outlines #1140

Use llamacpp or quantized transformers in examples

The docs currently use `transformers` all over the place, which is very difficult to run if you do not have a massive GPU. I'd like to convert all the examples to use `llamacpp` or a quantized version…

cpfiffer updated 3 weeks ago

intel-analytics/ipex-llm #11467

Unable to import LlamaCpp

Hi, I am unable to import LlamaCpp in IPEX CODE : from ipex_llm.langchain.llms import LlamaCpp ERROR Cell In[5], [line 1](vscode-notebook-cell:?execution_count=5&line=1) ----> [1](vscode-note…

abhishekkagautam updated 4 months ago

lmstudio-ai/lmstudio-bug-tracker #186

Bug: No way to set the quantisation used for the k/v context…

I was trying to find where to set which quantisation to use for the K/V context cache and it seems you can't in LM Studio. K/V cache quantisation is required to run models context efficiently by re…

sammcj updated 1 week ago

intel-analytics/ipex-llm #12258

Llamacpp generation incoherent (always <eos>). Driver versio…

I've tried using llamacpp in both docker and native versions using the provided guides: https://github.com/intel-analytics/ipex-llm/blob/main/docs/mddocs/Quickstart/llama_cpp_quickstart.md https://g…

ultoris updated 4 weeks ago

guidance-ai/guidance #978

LlamaCpp always using gpt2 tokeniser

**The bug** When using `models.LlamaCpp` the selected tokenizer is always gpt2 (This can be seen in the outut when `verbose=True` arg is set). I have pasted the dumped KV metadat keys ``` llama_mod…

prnvbn updated 3 months ago

gradio-app/gradio #8909

Sharing in chatbot is broken

### Describe the bug Not sure if this is a widespread issue, but as @osanseviero reported, sharing in https://huggingface.co/spaces/gokaygokay/Gemma-2-llamacpp is broken. > I tried https://hugging…

abidlabs updated 3 days ago

1000+ results for llamacpp

1000+ results
for llamacpp