Closed davidsbatista closed 2 months ago
this was fixed by https://github.com/deepset-ai/haystack/pull/7841
Having a more in depth call with Thomas he's aware of these issues and they had to do "tricks" and hacks to be able to evaluate it. I told him to reach out to me when dC starts using the relevancy-based evaluators from haystack 2.x
The MRR evaluator always returns 0.0 when we index/retrieve chunks of the original document.
Related to this: https://github.com/deepset-ai/haystack/blob/main/haystack/components/evaluators/document_mrr.py#L77
Because we index/retrieve not the original document with its full content, but chunks of the original document (the result of applying the DocumentSplitter) - so this will always fail on the
if ground_document.content in retrieved_document.content
One possible solution is to switch the test:
if retrieved_document.content in ground_document.content
, but this is still an expensive operation, and maybe if the meta field of a document contains an unique id, we could compare the ids instead.Discuss with Thomas Stadelmann about "context matching" in dC