-
[ ] I checked the [documentation](https://docs.ragas.io/) and related resources and couldn't find an answer to my question.
**Your Question**
> “WARNING:ragas.llms.output_parser:Failed to parse …
-
# Alex Strick van Linschoten - How to think about creating a dataset for LLM finetuning evaluation
I summarise the kinds of evaluations that are needed for a structured data generation task.
[https:…
-
# Task Name
Interactive Data Analysis
## Task Objective
Interactive Data Analysis, a collaboration between humans and Large Language Model (LLM) agents, enables real-time data exploration for…
-
1. mismatched machine unlearning
Title: Decoupling the Class Label and the Target Concept in Machine Unlearning
arXiv: https://arxiv.org/abs/2406.08288
2. evaluation of LLM unlearning
Title: Unl…
-
I have checked the [documentation](https://docs.ragas.io/) and related resources and couldn't resolve my bug.
**Describe the bug**
I am trying to run the template code from the Github ReadMe page.…
-
Thank you for your great work about LLM in agent. I would like to know when you will release all of the code (include implementation and evaluation code)?
Thank you.
-
So I'm trying to evaluate a llm response ad hoc.
I have multiple asserts like:
A: Check enum is in results for "Input A" in prompt
B: Check result is sql for Input B
C: Check there is LI…
-
When I run `evaluate` with any model of VertexAI, I get several warnings that say
> Gapic client context issue detected.This can occur due to parallelization.
And sometimes the execution of eva…
-
-
# Alex Strick van Linschoten - My finetuned models beat OpenAI’s GPT-4
Finetunes of Mistral, Llama3 and Solar LLMs are more accurate for my test data than OpenAI’s models.
[https://mlops.systems/pos…