Closed alvarobartt closed 6 months ago
Hi, sorry about the confusion. We run 2 games of judgment for each answer against the baseline. We switch position of the answers present to the judge to mitigate positional bias. That is why on the second game, A>>B or A>B means model_b wins. The code is already implemented correctly.
Hi, sorry about the confusion. We run 2 games of judgment for each answer against the baseline. We switch position of the answers present to the judge to mitigate positional bias. That is why on the second game, A>>B or A>B means model_b wins. The code is already implemented correctly.
Oh sorry for the oversight then!
Description
This PR fixes a bug within the
winner
field for the game 2, since when eitherA>B
orA>>B
the winner is set tomodel_b
while it should bemodel_a
; also applies to the other way around.cc @CodingWithTim