Closed fairy-of-9 closed 2 years ago
Hi!
Do you mean the mentions deleted when converting spans to words?
If so, they are still deleted for word-level evaluation, but they are still there for span-level evaluation.
In the span-level evaluation, Would it be reasonable to deduce that its impossible to get 100% if one span is removed during conversion?
Thanks!
It would. The only spans that get deleted are the ones that have the same head words as some other spans. As the model can only predict one span per a head word, it is impossible for it to predict such spans, thus the evaluation result on CoNLL-12 can never be 1.0 on the span level
It's clear now! Thanks for ur comments!
Hello, I have a question about evaluation. Are deleted mentions included in the evaluation?
For example, when 2% of the mention is removed, is the upper bound of the result of
calcuate_conll.py
100%?Thanks!