-
### Issue you'd like to raise.
I'm new to LangSmith and find the dataset structure more complicated (and confusing) than it needs to be.
In some ways, the dataset is treated like a table, with Inp…
-
-
**Bug Description**
When I query the "anthropic.claude-3-5-sonnet-20240620-v1:0" model on Bedrock (it also happens with ("anthropic.claude-3-haiku-20240307-v1:0", but here it's a smaller issue), a `b…
-
### Your current environment
The output of `python collect_env.py`
```text
Collecting environment information...
PyTorch version: 2.4.0+cu121
Is debug build: False
CUDA used to build PyTorch…
rymc updated
2 months ago
-
[X] I have checked the [documentation](https://docs.ragas.io/) and related resources and couldn't resolve my bug.
**Describe the bug**
I have a locally hosted LLM which I am intending to use as a jud…
-
We need to start dividing up the work, authoring the various Threats / Controls. We'll use this issue to manage that work and their assignments.
Each threat / control is 'ticked' when assigned to …
-
Hi,
I'm currently trying to replicate the performance of Qwen2-Audio on the AIR Bench. However, I noticed that the repository at [AIR-Bench](https://github.com/OFA-Sys/AIR-Bench/blob/main/score_cha…
-
### Question Validation
- [X] I have searched both the documentation and discord for an answer.
### Question
I designed a chatbot with an Agent to perform a series of actions.
My agent works like…
-
### Willingness to contribute
No. I can't contribute this feature at this time.
### Proposal summary
There are currently LLM as a judge metrics like Hallucination detection and Moderation score. Ho…
-
Opening this issue related to translation to Turkish where I did call for contributions [here](https://x.com/mervenoyann/status/1848267466314563825). 🇹🇷❣️
If you feel like getting started, you can …