Random error: for each labeling task, human has an ER = 0.00 / 0.02 / 0.05 / 0.10 chance of labeling incorrectly
Error Correction:
none:
Run with FAST2 (BM25+SEMI)
three:
Each doc will be labeled at least 2 times, at most 3 times.
machine
machine
Run error check every CR=50 docs reviewed:
Sort docs(code=='yes') by its prediction_probability on current classifier, pick bottom 10 for recheck
Sort docs(code=='no') by its prediction_probability on current classifier, pick top 10 for recheck
Above two steps are to find papers whose labeling human and machine disagree with
recheck ask the reviewer to label the selected docs again, with same error rate ER
If a doc has been labeled same as before, or has been labeled for 3 times, it will be frozen (will not be reckecked in the future).
machine2
Same to machine but:
for each paper previous coded as 'relevant' and currently selected as suspicious, ask reviewer to review it until it is frozen. (to decrease false negative)
machine3
Same to machine but:
for each paper coded as 'relevant', immediately ask reviewers to review it again until frozen. (so that no suspicious paper will be picked from 'relevant' side and false negative is expected to be decreased)
Error
Random error: for each labeling task, human has an ER = 0.00 / 0.02 / 0.05 / 0.10 chance of labeling incorrectly
Error Correction:
none:
Run with FAST2 (BM25+SEMI)
three:
Each doc will be labeled at least 2 times, at most 3 times.
machine
machine
Run error check every CR=50 docs reviewed:
Sort docs(code=='yes') by its prediction_probability on current classifier, pick bottom 10 for recheck
Sort docs(code=='no') by its prediction_probability on current classifier, pick top 10 for recheck
Above two steps are to find papers whose labeling human and machine disagree with
recheck ask the reviewer to label the selected docs again, with same error rate ER
If a doc has been labeled same as before, or has been labeled for 3 times, it will be frozen (will not be reckecked in the future).
machine2
Same to machine but:
machine3
Same to machine but:
Results
ER = 10%
ER = 5%
ER = 2%
ER = 0%
metrics
ER = 10%
ER = 5%
ER = 2%
ER = 0%