-
When chatting with an LLM, sometimes dir-assistant sends too much context to the llm.
Counts appear correct on dir-assistant's end. Perhaps this is because in some cases, the embedding models's tok…
-
Given that we have only Llama 3 70B and 8B, it would be useful to have a Tiny Llama based on the Llama 3 tokenizer so that we can use it as a drafting model for speculative decoding.
Are there pla…
-
### Prerequisites
- [X] I am running the latest code. Mention the version if possible as well.
- [X] I carefully followed the [README.md](https://github.com/ggerganov/llama.cpp/blob/master/README.md)…
-
Are there instructions specific to creating a bmodel from onnx for Llama 3.1 (not lllam3)
Running this is erroring out.
python export_onnx.py --model_path ../../../../Meta-Llama-3.1-8B-Instruct/ -…
-
### Bug Description
Occasionally the document `docstore/data` is not being created (KVDocumentStore) when `add_documents` is called
However `docstore/metadata` is consistently being created.
Th…
-
Documented here: https://github.com/abetlen/llama-cpp-python?tab=readme-ov-file#embeddings
Example:
```python
import llama_cpp
model = llama_cpp.Llama(model_path="all-MiniLM-L6-v2.e4ce9877.q8_…
-
### Bug Description
I am trying to replicate the RAG approach used [here](https://netraneupane.medium.com/retrieval-augmented-generation-rag-using-llamaindex-and-mistral-7b-228f93ba670f).
The soft…
-
I would love to use Claude more, but either it is crazy expensive, or I'm limited by rate and or tokens used.
On a larger project were there are tons of files that need to be evaluated created et…
-
### Initial Checks
- [X] I have searched GitHub for a duplicate issue and I'm sure this is something new
- [X] I have read and followed [the docs & demos](https://github.com/modelscope/modelscope-age…
-
**Describe the bug**
Using v0.2.146, installation works fine, but when i finished to create the integration, i got a "failed to configure" message.
**Expected behavior**
Integration shall s…
pbn42 updated
1 month ago