This is the PyTorch implementation for AdaGCL proposed in the paper Adaptive Graph Contrastive Learning for Recommendation, which is accepted by KDD 2023.
Yangqin Jiang, Chao Huang, and Lianghao Xia. 2023. Adaptive Graph Contrastive Learning for Recommendation. In Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD ’23), August 6–10, 2023, Long Beach, CA, USA. ACM, New York, NY, USA, 10 pages.
SSLRec is a PyTorch-based open-source deep learning framework for recommender systems enhanced by self-supervised learning techniques. With SSLRec, you can easily compare our AdaGCL with other research works in the field of recommendation systems.
Graph neural networks (GNNs) have recently emerged as an effective collaborative filtering (CF) approaches for recommender systems. The key idea of GNN-based recommender systems is to recursively perform message passing along user-item interaction edges to refine encoded embeddings, relying on sufficient and high-quality training data. However, user behavior data in practical recommendation scenarios is often noisy and exhibits skewed distribution. To address these issues, some recommendation approaches, such as SGL, leverage self-supervised learning to improve user representations. These approaches conduct self-supervised learning through creating contrastive views, but they depend on the tedious trial-and-error selection of augmentation methods.
To fill the crucial gap, this work proposes a novel Adaptive Graph Contrastive Learning (AdaGCL) framework that conducts data augmentation with two adaptive contrastive view generators to better empower the CF paradigm. Specifically, it uses two trainable view generators - a graph generative model and a graph denoising model - to create adaptive contrastive views. With two adaptive contrastive views, AdaGCL introduces additional high-quality training signals into the CF paradigm, helping to alleviate data sparsity and noise issues. The overall framework of AdaGCL is given below.
.
├── README.md
├── AdaGCL.png
├── performance.png
├── Main.py
├── Model.py
├── Params.py
├── DataHandler.py
├── Utils
│ ├── TimeLogger.py
│ └── Utils.py
└── Datasets
├── beerAdvocate
│ ├── trnMat.pkl
│ ├── tstMat.pkl
│ └── valMat.pkl
├── lastFM
│ ├── trnMat.pkl
│ ├── tstMat.pkl
│ └── valMat.pkl
└── sparse_yelp
├── trnMat.pkl
├── tstMat.pkl
└── valMat.pkl
We develop our codes in the following environment:
- python==3.9.13
- numpy==1.23.1
- torch==1.11.0
- scipy==1.9.1
- torch-sparse==0.6.17
Dataset | # User | # Item | # Interaction | Interaction Density |
---|---|---|---|---|
Last.FM | 1,892 | 17,632 | 92,834 | 2.8 × |
Yelp | 42,712 | 26,822 | 182,357 | 1.6 × |
BeerAdvocate | 10,456 | 13,845 | 1,381,094 | 9.5 × |
The command lines to train AdaGCL on the three datasets are as below. The un-specified hyperparameters in the commands are set as default.
- Last.FM
python Main.py --data lastfm --gamma -0.95 --ib_reg 1e-2
- Yelp
python Main.py --data yelp --ssl_reg 1 --ib_reg 1e-2 --epoch 100
- BeerAdvocate
python Main.py --data beer --ib_reg 1e-2 --lambda0 1e-2 --ssl_reg 1
Performance comparison of baselines on different datasets in terms of Recall@20, NDCG@20, Recall@40, and NDCG@40:
If you find this work helpful to your research, please kindly consider citing our paper.
@inproceedings{jiang2023adaptive,
title={Adaptive Graph Contrastive Learning for Recommendation},
author={Jiang, Yangqin and Huang, Chao and Huang, Lianghao},
booktitle={Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining},
pages={4252--4261},
year={2023}
}