Double Correction Framework for Denoising Recommendation

The code and datasets of our KDD 2024 paper "Double Correction Framework for Denoising Recommendation"

Overview

We propose a Double Correction Framework for Denoising Recommendation (DCF), which contains two correction components from views of more precise sample dropping and avoiding more sparse data. In the sample dropping correction component, we use the loss value of the samples over time to determine whether it is noise or not, increasing dropping stability. Instead of averaging directly, we use the damping function to reduce the bias effect of outliers. Furthermore, due to the higher variance exhibited by hard samples, we derive a lower bound for the loss through concentration inequality to identify and reuse hard samples. In progressive label correction, we iteratively re-label highly deterministic noisy samples and retrain them to further improve performance. Finally, extensive experimental results on three datasets and four backbones demonstrate the effectiveness and generalization of our proposed framework.

Requirements

The model is implemented using PyTorch. The versions of packages used are shown below.

numpy==1.19.5
scikit-learn==0.24.2
torch==1.8.1
CUDA==10.2

Data Preparation

The three data source we use comes from Adressa , Yelp and MovieLens.

Special thanks

Very thanks to Dr.Wenjie Wang with his code DenoisingRec.

Quick run

python main.py

Citation

If you find this useful for your research, please kindly cite the following paper:

@article{he2024double,
  title={Double Correction Framework for Denoising Recommendation},
  author={He, Zhuangzhuang and Wang, Yifan and Yang, Yonghui and Sun, Peijie and Wu, Le and Bai, Haoyue and Gong, Jinqi and Hong, Richang and Zhang, Min},
  journal={arXiv preprint arXiv:2405.11272},
  year={2024}
}

bruno686 / DCF