-
### Area(s)
area:gen-ai
### Is your change request related to a problem? Please describe.
The current conventions for GenAI spans require the span kind to be `CLIENT` for GenAI applications interac…
-
I'm new to this specific project, and I don't say any of the following with high confidence.
Things that I see as important for quantization:
*Inference speed*
- AWQ seems best on this front, t…
-
## Type
- [x] General question or discussion
- [x] Propose a brand new feature
- [ ] Request modification of existing behavior or design
## What is the problem that your feature reques…
-
topic: guided generation
**Guided Generation** - Deliver structured outputs from the unstructured data using LLM
https://jxnl.github.io/instructor/why/
https://jxnl.github.io/instructor/concepts/p…
-
after all requirements have been satisfied, but hit memory malloc fails, as following output, while trying to run simple_ui.py
is there a minimum hbm memory requirement for the device? and i tried it…
-
Description:
I'm encountering an error while running a Flask application that integrates Gemini models with NemoGuardRails. The error seems to be related to asynchronous task handling within LLMRails…
-
Do you have some benchmarks against llama.cpp?
-
The money spent purchasing market-based instruments (offsets) means investment goes **away** from green software and energy efficiency initiatives and **towards** low-carbon energy suppliers and carbo…
-
### Checked other resources
- [X] I added a very descriptive title to this issue.
- [X] I searched the LangChain documentation with the integrated search.
- [X] I used the GitHub search to find a sim…
-
## 🌟 Feature Description
Support for open-source llms (e.g., Ollama) in RD-Agent, allowing the integration and utilization of these models within the RD-Agent framework.
## Motivation
Adding Ol…