Closed jssmith1 closed 7 years ago
R3 requests clarification about the statistical tests we applied. We tested whether participants performed differently — in terms of task completion time and correctness — using the full suite of Eclipse tools, compared with Flower. We first tested the distribution task completion times for normality using the Shapiro-Wilk test and failed to reject the null hypothesis (p=.21 and p=.43 for each task, respectively). Therefore, we tested for differences in task completion time by performing two-tailed, unpaired, two-sample t-tests. We tested for differences in task correctness using a chi square test, which is appropriate for nominal data.
Not sure how much of this to include given space limitations...
SGTM
Regarding the evaluation, can you give more details any hypothesis tested and the appropriateness of the tests applied – e.g. relating to the variance of the samples?