JULIELab / trec-pm

Support code and resources for participation at the TREC Precision Medicine Track (TREC-PM)
http://trec-cds.appspot.com
MIT License
9 stars 2 forks source link

Add support for paired statistical tests #112

Closed michelole closed 4 years ago

michelole commented 4 years ago

The current Python code for statistical testing uses the randtest library for approximate randomization, which assumes groups are independent.

However, our experimental units (topics) are paired (i.e., we run all experiments on all of them) and therefore there is a dependence among runs.

This commit reimplements statistical testing using the popular R coin library on top of the existing code for parsing trec_eval and sampleval files.