-
I have noticed on some online examples that some users struggle to understand how our Secret Management works.
[#1](https://www.mindfiretechnology.com/blog/archive/installing-haystack-for-pgvector-in…
-
**The bug**
I have a minimal reproducible example where I would expect `select` and `gen` to produce similar results, but they don't. My experimentation suggests maybe a tokenization or token healing…
-
### Describe the issue as clearly as possible:
The example with custom fsm from documentation doesnt work for LlamaCpp as
```
logits, kv_cache = model(token_ids, attention_masks, kv_cache)
…
-
Hi,
I tried to use `exllamv2` with Mistral 7B Instruct instead of my `llama-cpp-python` test implementation.
`exllamv2` works, but the performance is very slow compared to `llama-cpp-python.`
…
-
Thank you guys for doing the work to put all of this together.
I'm having trouble with the openapi-generator for this project and thought it would be nice if you guys could put a sample of how to…
-
### Bug Description
While using `SQLAutoVectorQueryEngine`, if I set `verbose=True` and provide it a `vector_query_tool` with `streaming = True`, the `StreamingResponse` returned doesn't stream.
…
-
[ /] I have checked the [documentation](https://docs.ragas.io/) and related resources and couldn't resolve my bug.
**Describe the bug**
I'm currently trying to use HuggingFace LLMs and embedding m…
-
**Is your feature request related to a problem? Please describe.**
I'm trying to evaluate a local LLM model using Exllamav2 and Deepbench's support for the [MMLU dataset](https://docs.confident-ai.co…
-
https://github.com/llm-jp/llm-jp-eval/pull/115 の `offline_inference_example.py` を参考にvLLMでオフライン推論処理を実装する。
-
### Summary
Nocturnal is a tool that will allow artists to easily create NFTs on TON blockchain and put them on sale.
With easy to use interactive graphic interface, Nocturnal will allow set u…