Closed Git-Shaw closed 1 year ago
Hi, thank you for your interest, and apologies for the late reply.
Regarding your question, "For example, there is a dialog that contains both cross- and uncross-utterance quadruples. Should the uncross ones be included in the preds or golds?":
The answer is that when computing the cross-utterance metrics, we only consider the cross-utterance quadruples in both the preds and golds sets.
We have supplemented the evaluation code to handle cross-utterance quadruples. FYI.
Hello, there is an experiment perform on cross-utterance quadruple in the paper, and I noticed that it uses micro f1 metric.
I have identified the quads of cross-utterance, and the number is consistent with Table 2 in the paper.
But I couldn't reproduced the scores in the fig.6 of paper. I'm confused about what pred and gold refer to respectively according to the code. For example, there is a dialog contains both cross- and uncross-utterance quadruple, if the uncross one should be included in the preds or golds?
Could you give me some guidence or the evalution code about calculating the cross-utterance micro-f1 plz? I think it is also helpful for others.
Thank you in advance!