Open benhachey opened 10 years ago
One of these should replicate the Hoffart MAP@1 evaluation (see #9).
We need to allow a confidence score in the system response to calculate this.
As discussed (#9), MAP needs to include disambiguation only setting which can be implemented as a data filter.
One of these should replicate the Hoffart MAP@1 evaluation (see #9).