Open bjj opened 9 months ago
Thanks for the feedback! The concern is indeed valid. I will need to think about how tie should be handled, though.
I did not implement the "both wrong" "both ok" out of two reasons:
I need to think about how to handle tie in elimination matches, or to replace elimination with something else.
After running through some test prompts there are many instances where there is nothing to separate the two answers (they're both wrong exactly the same amount or in the same way). There's probably something more statistically valid than picking at random in those cases.