How are duplicate entities handled in the f1-scores?

@sshon-asapp, @felixgwu, @apasad-asapp The paper End-to-end named entity recognition from English speech by Yadav et al. specifies that they do not consider duplicate (tag, phrase) pairs while considering their precision-recall scores.

Your paper On the Use of External Data for Spoken Named Entity Recognition says that it uses the f1-measures from Yadav et al., but the evaluation in slue-toolkit code that you use to evaluate the scores for the results does not remove duplicates and effectively compares (tag, phrase, identifier) triplets for the f1-score.

Could you please clarify which metric you used for the results published in your paper.

asappresearch / spoken-ner

How are duplicate entities handled in the f1-scores? #1