-
## Current Code
Used the [pipecat example code here](https://github.com/pipecat-ai/pipecat/blob/main/examples/foundational/15-switch-voices.py) to define the context and pass it to OpenAILLMContext.
…
-
- [ ] Using pdfs used for QA creation, creating KG
- [ ] Testing on those pdfs
- [ ] Prompt tuning for entities linked below
- [ ] Testing using LLM eval.
[Microsoft graph rag repo link ](http…
-
Conduct comprehensive testing of the outputs generated by the pipeline and Gaia node. The testing process will include validating the accuracy and completeness of the data embeddings, ensuring that th…
-
This is when you usually will lose any type of verbose response in inferencing
`torch.OutOfMemoryError: CUDA out of memory. Tried to allocate 15.33 GiB. GPU 0 has a total capacity of 24.00 GiB of w…
-
**What would you like to be added/modified**:
A benchmark suite for large language models deployed at the edge using KubeEdge-Ianvs:
1. Interface Design and Usage Guidelines Document;
2. Implem…
-
Affected versions: 78f5c2936b7bdaa56859075a3f2fcc5a63952134, 5fa9436e17c2f9aeace070f49aa645d2577f676b
Affected APIs: GptManager, Executor
Hardware: H100
```
python examples/llama/convert_checkpoint.p…
-
### What happened?
Hi, when stress testing llama-server (--parallel 3, prompt="Count 1 to 10000 in words") and running deepseek-coder-v2:16b-lite-instruct-q8_0 i got this assertion error in the logs…
-
I saw the link regarding the new GSZ that will be presented at SC24. Looks very useful and thus wanted to see if the code will be posted here for use and testing? I'm working on compressing llm activa…
-
We are building a voice-interactive chatbot that leverages cutting-edge technologies such as Speech-to-Text (STT), Text-to-Speech (TTS), and local Large Language Models (LLMs), with a focus on Ollama'…
-
Thank you for your insightful work and contribution. I just have a few questions regarding the work in the paper and the code implementation:
1. I understand that we have used the training data or the…