Closed mhoangvslev closed 2 months ago
decision_threshold = 1
because we want a markup to be considered valid if and only if all information is correct.decision_threshold = 0.5
, we tolerate minor inaccuracies like in SelfCheckGPT, i.e, if the majority of information is correct, the markup remains correct. decision_threshold = 0.5
and results of Compliance Checker are based on decision_threshold = 1
. The final results you see on this repo is based on decision_threshold = 0.5
.
Context
SemanticValidator
andFactualValidator
.SemanticValidator
andFactualValidator
returns a score [0, 1], which is the ratio of positives over the number of key-value pairs.label = int(score > decision_threshold)
decision_threshold < 0.5
leads to higher recall, whiledecision_threshold > 0.5
leads to higher precision.decision_threshold
?