Closed lenglund closed 8 years ago
Michael will run some simulations to see what the best number of evaluations is for different group sizes.
Conclusions:
We'd need to collaborate with researchers to implement weighting, should this issue go forward.
Decision has been made to update the algorithm to include weighting, possibly using the Elo rating system (https://en.wikipedia.org/wiki/Elo_rating_system). Ido has the details for this.
Some observations of the algorithms after some simulations
Corrected Comparative Judgement Algorithm
Elo Algorithm
True Skill
We need to decide on some criteria for selecting with algorithm we want to go
Ido mentioned that:
Elo might be the better candidate given these criteria
Ido would like to discuss the ranking algorithm more in-depth to determine if it can/should be further fine-tuned for future use. Right now, many of the answer pairs receive the same score despite being compared 8-12 times.
Possibly add ability to select algorithm? (issue #312)