Closed benhachey closed 10 years ago
I skimmed back over the Hoffart (2011) paper: "We consider only mention-entity pairs where the ground-truth gives a known entity, and thus ignore roughly 20% of the mentions without known entity in the ground-truth."
We should address this when we add rank measures (see #10).
Hoffart et al. (2012) state that they follow the Hoffart et al. (2011) evaluation methodology.
So, it's very good to have the Cornoloti et al. numbers there and the comparison should be correct, especially once mapping is up and running (#11).
It's nice to have the other numbers as well, with the caveat that there is no direct comparison until rank measures are added in a future release (#10).
I think the best way to handle this is to specify a subset of mentions for a filter operation. We may be able to use the means file.
I'm trying to get TagMe2 output. Let's use that if we can.
I've added to the README and reported results .csv file. What do you think?