Skip to content

Commit

Permalink
Initial commit
Browse files Browse the repository at this point in the history
  • Loading branch information
downeykking committed Sep 15, 2023
0 parents commit c07999c
Show file tree
Hide file tree
Showing 315 changed files with 234,152 additions and 0 deletions.
160 changes: 160 additions & 0 deletions .gitignore
Original file line number Diff line number Diff line change
@@ -0,0 +1,160 @@
# Byte-compiled / optimized / DLL files
__pycache__/
*.py[cod]
*$py.class

# C extensions
*.so

# Distribution / packaging
.Python
build/
develop-eggs/
dist/
downloads/
eggs/
.eggs/
lib/
lib64/
parts/
sdist/
var/
wheels/
share/python-wheels/
*.egg-info/
.installed.cfg
*.egg
MANIFEST

# PyInstaller
# Usually these files are written by a python script from a template
# before PyInstaller builds the exe, so as to inject date/other infos into it.
*.manifest
*.spec

# Installer logs
pip-log.txt
pip-delete-this-directory.txt

# Unit test / coverage reports
htmlcov/
.tox/
.nox/
.coverage
.coverage.*
.cache
nosetests.xml
coverage.xml
*.cover
*.py,cover
.hypothesis/
.pytest_cache/
cover/

# Translations
*.mo
*.pot

# Django stuff:
*.log
local_settings.py
db.sqlite3
db.sqlite3-journal

# Flask stuff:
instance/
.webassets-cache

# Scrapy stuff:
.scrapy

# Sphinx documentation
docs/_build/

# PyBuilder
.pybuilder/
target/

# Jupyter Notebook
.ipynb_checkpoints

# IPython
profile_default/
ipython_config.py

# pyenv
# For a library or package, you might want to ignore these files since the code is
# intended to run in multiple environments; otherwise, check them in:
# .python-version

# pipenv
# According to pypa/pipenv#598, it is recommended to include Pipfile.lock in version control.
# However, in case of collaboration, if having platform-specific dependencies or dependencies
# having no cross-platform support, pipenv may install dependencies that don't work, or not
# install all needed dependencies.
#Pipfile.lock

# poetry
# Similar to Pipfile.lock, it is generally recommended to include poetry.lock in version control.
# This is especially recommended for binary packages to ensure reproducibility, and is more
# commonly ignored for libraries.
# https://python-poetry.org/docs/basic-usage/#commit-your-poetrylock-file-to-version-control
#poetry.lock

# pdm
# Similar to Pipfile.lock, it is generally recommended to include pdm.lock in version control.
#pdm.lock
# pdm stores project-wide configurations in .pdm.toml, but it is recommended to not include it
# in version control.
# https://pdm.fming.dev/#use-with-ide
.pdm.toml

# PEP 582; used by e.g. github.com/David-OConnor/pyflow and github.com/pdm-project/pdm
__pypackages__/

# Celery stuff
celerybeat-schedule
celerybeat.pid

# SageMath parsed files
*.sage.py

# Environments
.env
.venv
env/
venv/
ENV/
env.bak/
venv.bak/

# Spyder project settings
.spyderproject
.spyproject

# Rope project settings
.ropeproject

# mkdocs documentation
/site

# mypy
.mypy_cache/
.dmypy.json
dmypy.json

# Pyre type checker
.pyre/

# pytype static type analyzer
.pytype/

# Cython debug symbols
cython_debug/

# PyCharm
# JetBrains specific template is maintained in a separate JetBrains.gitignore that can
# be found at https://github.com/github/gitignore/blob/main/Global/JetBrains.gitignore
# and can be added to the global gitignore or merged into this file. For a more nuclear
# option (not recommended) you can uncomment the following to ignore the entire idea folder.
#.idea/
160 changes: 160 additions & 0 deletions README.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,160 @@
# Enhancing Graph Collaborative Filtering via Neighborhood Structure Embedding

*Xinzhou Jin, Jintang Li, Yuanzhen Xie, Liang Chen, Beibei Kong, Lei Cheng, Bo Hu, Zang Li, Zibin Zheng*

This is the official PyTorch implementation of our paper that has been accepted to 2023 IEEE International Conference on Data Mining (ICDM 2023). [[Paper]]()

<img src="./figure/fig.png" alt="fig" style="zoom:100%;" />



## Requirements:

```
python>=3.9.13
pytorch>=1.12.1
torch-geometric>=2.2.0
torch-sparse>=0.6.15+pt112cu116
numpy>=1.24.3
pandas>=1.5.0
CUDA 11.6
```



## Installation

```bash
pip install -r requirements.txt
# PyTorch and PyG depend on your own cuda environment. The following instructions are for CUDA11.6.
pip install torch==1.12.1+cu116 -f https://download.pytorch.org/whl/torch/
pip install torch-sparse==0.6.16 -f https://pytorch-geometric.com/whl/torch-1.12.1+cu116.html
pip install torch-scatter==2.1.0 -f https://pytorch-geometric.com/whl/torch-1.12.1+cu116.html
pip install torch-geometric
```



## Datasets

| Datasets | #Users | #Items | #Interactions | Density |
| -------- | ------- | ------ | ------------- | ------- |
| ML-1M | 6,039 | 3,628 | 836,478 | 0.03818 |
| Yelp | 45,477 | 30,708 | 1,777,765 | 0.00127 |
| Books | 58,144 | 58,051 | 2,517,437 | 0.00075 |
| Gowalla | 29,858 | 40,988 | 1,027,464 | 0.00084 |
| Alibaba | 300,000 | 81,614 | 1,607,813 | 0.00007 |

For `ml-1m` , `yelp`, `amazon-books`, `gowalla-merged`, they will be automatically downloaded via RecBole once you run the main program.

For `alibaba`, we provide it under `dataset/`

```
cd dataset
unzip alibaba.zip
```



## Implementation of NSE:

NSE is easy to implement as follows (PyTorch-style):

```python
def get_neighbor_adj(self):
from torch_sparse import SparseTensor
sp_adj = SparseTensor(row=self._user, col=self._item, value=torch.ones(
len(self._user)), sparse_sizes=(self.n_users, self.n_items))
return sp_adj


def get_ego_embeddings(self):
r""" Get the embedding of users and items and combine to an embedding matrix.
Returns:
Tensor of the embedding matrix. Shape of [n_items+n_users, embedding_dim]
"""
from torch_geometric.utils import spmm
user_embeddings = spmm(self.sp_adj, self.item_embedding.weight).to_dense()
item_embeddings = spmm(self.sp_adj.t(), self.user_embedding.weight).to_dense()
ego_embeddings = torch.cat([user_embeddings, item_embeddings], dim=0)

return ego_embeddings
```



## Reproduction

We integrate our NSE-LGCN method into the [RecBole](https://recbole.io/) and [RecoBole-GNN](https://github.com/RUCAIBox/RecBole-GNN) framework.

#### ML-1M


```
python run_recbole_gnn.py --dataset "ml-1m" --model "NSELightGCN" --n_layers 3 --reg_weight 0.0001 --learning_rate 0.0002
```

#### Yelp


```
python run_recbole_gnn.py --dataset "yelp" --model "NSELightGCN" --n_layers 3 --reg_weight 0.01 --learning_rate 0.0001
```

#### Amazon-books


```
python run_recbole_gnn.py --dataset "amazon-books" --model "NSELightGCN" --n_layers 3 --reg_weight 0.001 --learning_rate 0.0001
```

#### Gowalla


```
python run_recbole_gnn.py --dataset "gowalla-merged" --model "NSELightGCN" --n_layers 3 --reg_weight 0.0001 --learning_rate 0.0001
```

#### Alibaba


```
python run_recbole_gnn.py --dataset "alibaba" --model "NSELightGCN" --n_layers 3 --reg_weight 1e-06 --learning_rate 0.0001
```





If you want to run on the synthetic datasets, add `--ptb_strategy=replace` to the above commands. For example:

```
python run_recbole_gnn.py --dataset "ml-1m" --model "NSELightGCN" --n_layers 3 --reg_weight 0.01 --learning_rate 0.0001 --ptb_strategy=replace
```



## Acknowledgement

This repo is mainly based on [RecBole](https://recbole.io/) and [RecoBole-GNN](https://github.com/RUCAIBox/RecBole-GNN). Many thanks to their wonderful work!



## Citation

If you find this work is helpful to your research, please consider citing our paper:

```
@inproceedings{jin2023enhancing,
title={Enhancing Graph Collaborative Filtering via Neighborhood Structure Embedding},
author={Xinzhou Jin and Jintang Li and Yuanzhen Xie and Liang Chen and Beibei Kong and Lei Cheng and Bo Hu and Zang Li and Zibin Zheng},
booktitle={ICDM},
year={2023}
}
```



## Contact

If you have any questions about this work, please feel free to contact me via [email protected]
Binary file added dataset/alibaba.zip
Binary file not shown.
Binary file added figure/fig.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
5 changes: 5 additions & 0 deletions recbole/__init__.py
Original file line number Diff line number Diff line change
@@ -0,0 +1,5 @@
from __future__ import absolute_import
from __future__ import print_function
from __future__ import division

__version__ = "1.1.1"
1 change: 1 addition & 0 deletions recbole/config/__init__.py
Original file line number Diff line number Diff line change
@@ -0,0 +1 @@
from recbole.config.configurator import Config
Loading

0 comments on commit c07999c

Please sign in to comment.