Closed zhangyikaii closed 2 years ago
Hi, thanks for your interest. I will check it carefully and give a reply in a few days.
LEEP/NCE in the paper were calculated by historical code with bugs. Per the request of this issue, I gave an implementation of LEEP and NCE code. I fixed a bug by the way, and the LEEP/NCE code in this repo should work.
I forgot to update the results. New results are available here, calculated by the LEEP/NCE code in this repo.
Conclusions from both the paper and the updated results are the same: LEEP/NCE are quite unstable and the overall performance are inferior to LogME in many datasets.
I will make it clear in the documentation. Sorry for the inconvenience.
Thank you very much!
祝顺利.
Hi, thanks for your excellent work and friendly code style again. I encountered some difficulties in reproducing the NCE / LEEP results. Sorry, I am confused again.
I guess the calculation of NCE and LEEP scores in the paper may be wrong. Actually, I used the opposite of
pseudo_source_label
andtarget_label
to obtain the results of the paper, as shown in the code below:where the
predictions
variable is calculated fromoutput
, i.e.,The above wrong code successfully reproduces the results of NCE and LEEP in the paper, i.e., Section C. Original Results in Figures - Table 5. Original results in Figure 4.
From the NCE and LEEP code, the correct way to pass the
pseudo_source_label
andtarget_label
would be:But this does not yield the results in the paper. This will impact the calculation of the final weighted tau result.
Thanks again for your great work and open source spirit. I'm really sorry to come back to you for advice. Thank you very much.