Closed vecorro closed 8 months ago
Hey @vecorro , there is a mistake in your code
from ragas.testset.evolutions import simple, reasoning, multi_context
distributions={simple: 0.5, reasoning: 0.25, multi_context: 0.25}
evolutions are objects and not strings. Which documentation did you follow? It could be an older one which we forgot to update.
Also a recommendation, it would be better to use mixtral 7*8
than models from wizard ml (derivatives of llama-2) @vecorro
Many thanks @shahules786 you are right. I took the example from another user commenting on other issues. I re-checked the existing online document on Generate a Synthetic Test Set and looks like the code block you shared with me. The error went away although the generation process has over 2hr running. I requested a 'test_size=10' over a 100 documents corpus. I still don't get the exact meaning of the 'test_size' parameter but the documentation does not provide an explanation about it. Anyways I'm closing this issue as the error I reported was due to a misconfiguration from my side.
BTW, you were right. The Wizard LLM was causing errors in the generation process. I switched to Zephyr which is derived from Mistral 7b and seems to be working, or at least not generation execution errors :) Many, many thanks
[X] I have checked the documentation and related resources and couldn't resolve my bug.
Describe the bug Hi,
I tried to generate a test dataset with TestsetGenerator from both LangChain and LlamaIndex documents and I'm hitting the same issue. It looks like it is due to a bad attribute transfer across classes where the generator LLM object gets replaced by a str. Please f ind the details below.
Thanks very much!
Ragas version: 0.1.4 Python version: 3.10
Code to Reproduce
Error trace
Expected behavior I expected a successful generation of the dataset
Additional context The same error occurs when I try to generate from langchain or llamaindex docs.