Closed relic-yuexi closed 2 months ago
i have know why will make difference between pytrec_eval and ndcg_score. But i still don't know why changed key name will lead different score.
An example that isolates the difference more clearly would help us understand what's happening here. I suspect it comes down to how each handles tie-breaking in the run scores, given the document ID that's changing is tied with another one and would be sorted at a different position with the new ID.
Hello author, I have encountered some strange results. The reproduce code is below.