:mag: AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.
When running two pipelines and comparing their results to each other, I would like to see the predicted answers of each pipeline run in the resulting pandas dataframe.
When running two pipelines and comparing their results to each other, I would like to see the predicted answers of each pipeline run in the resulting pandas dataframe.
Here is an example of how this is done: https://github.com/deepset-ai/haystack-evaluation/blob/1f2747ec59101231c9857fd2b07948c87ed9d181/evaluations/evaluation_sentence_window_retrieval.py#L123
When I run this script, the resulting csv has these columns:
Instead of
predicted_answers
, it should have:base-rag_predicted_answers
andwindow-retrieval_predicted_answers
.