R1.12. The author claim (page 6, Discussion) to have test their
model, but what they did only a calibration against a number
of ground-truth metrics. How would you validate this model?
In other words, unless the model was really bad,
I would expect anyway that an optimization process would
give you parameters that reproduce well the ground truth,
but does the model capture really quality and expertise, or
just the ground truth you gave it to approximate? The
discussion section could be a good place where to talk about
this.
R1.12. The author claim (page 6, Discussion) to have test their model, but what they did only a calibration against a number of ground-truth metrics. How would you validate this model? In other words, unless the model was really bad, I would expect anyway that an optimization process would give you parameters that reproduce well the ground truth, but does the model capture really quality and expertise, or just the ground truth you gave it to approximate? The discussion section could be a good place where to talk about this.