I ran the baseline codes for P12; however, the results show higher scores than the proposed method reported in the original paper.
The results I reproduced for mTAND were 85.1 AUROC and 52.4 AUPRC on the P12 dataset. In the paper, the performance of the proposed ViTST on P12 was reported as 85.1 AUROC and 51.1 AUPRC.
I also reproduced the transformer-mean but the AUROC and AUPRC scores were 85.3 and 51.7.
The Transformer-mean and mTAND AUPRC was higher than that of ViTST.
Is there anything I missed when reproducing the results?
Thank you for providing the interesting work!
I ran the baseline codes for P12; however, the results show higher scores than the proposed method reported in the original paper.
The results I reproduced for mTAND were 85.1 AUROC and 52.4 AUPRC on the P12 dataset. In the paper, the performance of the proposed ViTST on P12 was reported as 85.1 AUROC and 51.1 AUPRC.
I also reproduced the transformer-mean but the AUROC and AUPRC scores were 85.3 and 51.7.
The Transformer-mean and mTAND AUPRC was higher than that of ViTST.
Is there anything I missed when reproducing the results?
Thank you!
[Reproduced mTAND results]
[Reproduced Transformer-mean results]
[The paper results]
[Summarized results (P12)]
[Summarized results (P19)]