Closed smallz2001 closed 2 months ago
Hi @sonyawong pls, help me !
I'm confusing with the same problem. I found the performances of DKT, DKVMN,AKT, SAKT, SAINT on other datasets are generally 5-10% higher than those in the original paper. I suspect it may be information leakage.
oh I found the answer here: https://github.com/pykt-team/pykt-toolkit/issues/144 It said that when using KC-level model, the metrics started with "window{metrics}late" shall prevail, because of information leakage
Hello, thank you for your contributions to the research on KT. In my investigation into why the AKT model achieves such high performance, it seems that there might be an issue of information leakage. When I utilized only the encoder part of the transformer within AKT, I observed a phenomenon of performance inflation. The specific practice is as follows:
The code
was replaced with:
This indicates that the masking is not successful, leading to the performance inflation of AKT, which suggests that the AKT model has knowledge of future information.