-
**What problem or use case are you trying to solve?**
This is mostly for evaluation purpose. I am not sure what this mode should be properly named, so let me describe the scenario here:
1) Somet…
-
### Proposal summary
See in the UI the traces that are produced when using LLM as Judges, both General and LLM spans.
### Motivation
When using LLM as Judged metrics in your Evaluations, it is usef…
-
Hello, I use local data, including reference and response data, use Azure, and then use ragas to obtain accuracy indicators, but errors are reported:
`
Evaluating: 0%| | 1/301 [00:04
-
https://eugeneyan.com/writing/llm-evaluators/
-
Evaluate using https://github.com/xenova/transformers.js as local executor or fallback for when webgpu is not available
-
As we begin to evaluate LLM assisted root cause analysis, we need a way to be able to evaluate the validity and usefulness of the results.
Historically, our process for evaluating these results has …
-
### Proposal summary
## Feature Request
Enable Opik to display additional media formats, including audio, PDF, and video.
## Background
Opik currently supports only image display, which li…
-
**Describe the bug**
while running matrices **Knowledge retention**, getting error. I ensure that this is not all of the LLMTestcases. I am getting correct knowledge retention score for many inputs. …
-
Hello,
Thank you very much for open-sourcing this project! I am currently researching tools that utilize LLMs for XPath extraction and came across your project, llm-xpath. It looks very interesting…
-
### Feature request / 功能建议
how to use a local LLM to evaluate prediction quality? For example, Llama-3-70B-Instruct?
### Motivation / 动机
how to use a local LLM to evaluate prediction quality? For …