-
### .env
```
# Generic
TEXT_EMBEDDINGS_MODEL=sentence-transformers/all-MiniLM-L6-v2
TEXT_EMBEDDINGS_MODEL_TYPE=HF # LlamaCpp or HF
USE_MLOCK=false
# Ingestion
PERSIST_DIRECTORY=db
DOCUMENTS…
-
I would like to run the `ggml/gpt-j` version on the MLPerf benchmark. Is it possible to use a fine-tuned GPT-J checkpoint listed here: https://github.com/mlcommons/inference/blob/master/language/gpt-j…
-
After installing the code and models successfully I ran per the directions in the README. However the output is useless from a code development perspective and does not at all match the README files …
-
### Description
I get an error Method not found: 'Double Microsoft.KernelMemory.AI'
### Reproduction Steps
repeating the example, except replacing the document with text
https://github.com/SciSha…
-
Description:
The commit 93935980f8bcc3d230d313174ff59635c3c80d1b introduced Metal and ggml-alloc support in the whisper.cpp repository. However, this commit caused the initialization of the whisper c…
-
Hi @muelletm. I am getting error on Ubuntu 22.04.1 LTS.
```
from alpaca import Alpaca, InferenceRequest
# ./chat is the executable I use to run alpaca directly.
alpaca_cli_path = './chat'
mode…
-
Doesn't happen with the other Vulkan backend.
-
Is there anyway to run it in 4G or less vram?
ggml? or gptq?
-
Issue:
```
curl http://localhost:11434/api/embeddings -d '{
"model": "mxbai-embed-large",
"prompt": "Llamas are members of the camelid family"
}'
{"error":"llama runner process has terminate…
-
### What is the issue?
after version 0.1.32 of Ollama,there always have a bug with the api of embedding
the embedding model I used is "smartcreation/bge-large-zh-v1.5",and dztech/bge-large-zh:v1.5 …