YadaYuki / news-recommendation-llm

Pre-trained Large Language Model (BERT) Based News Recommendation using Python / PyTorch 🌎
MIT License
33 stars 1 forks source link
News Contents on Smartphone

News Recommendation using LLM 🌎

Pre-trained Large Language Model Based News Recommendation using Python / PyTorch 🚀


Project Structure

The project structure is as below.

$ tree -L 2
├── README.md
├── dataset/
│   └── download_mind.py
├── pyproject.toml
├── requirements-dev.lock
├── requirements.lock
├── src/
│   ├── config/
│   ├── const/
│   ├── evaluation/
│   ├── experiment/
│   ├── mind/
│   ├── recommendation/
│   │   └── nrms/
│   │       ├── AdditiveAttention.py
│   │       ├── NRMS.py
│   │       ├── PLMBasedNewsEncoder.py
│   │       ├── UserEncoder.py
│   │       ├── __init__.py
│   └── utils/
└── test/
    ├── evaluation/
    ├── mind/
    └── recommendation/




At first, create python virtualenv & install dependencies by running

$ rye sync

If you successfully created a virtual environment, a .venv/ folder should be created at the project root.

Then, please set PYTHONPATH by runnning

$ export PYTHONPATH=$(pwd)/src:$PYTHONPATH

Download Microsoft News Dataset (MIND)

We use MIND (Microsoft News Dataset) dataset for training and validating the news recommendation model. You can download them by executing dataset/download_mind.py.

$ rye run python ./dataset/download_mind.py

By executing dataset/download_mind.py, the MIND dataset will be downloaded from an external site and then extracted.

If you successfully executed, dataset folder will be structured as follows:

├── download_mind.py
└── mind
    ├── large
    │   ├── test
    │   ├── train
    │   └── val
    ├── small
    │   ├── train
    │   └── val
    └── zip
        ├── MINDlarge_dev.zip
        ├── MINDlarge_test.zip
        ├── MINDlarge_train.zip
        ├── MINDsmall_dev.zip
        └── MINDsmall_train.zip


Fine Tune a model

If you execute src/experiments/train.py, the news recommendation model will be finetuned on the MIND small dataset. Hyperparameters can be specified from the arguments.

$ rye run python src/experiments/train.py -m \
    random_seed = 42 \
    pretrained = "distilbert-base-uncased" \
    npratio = 4 \
    history_size = 50 \
    batch_size = 16 \
    gradient_accumulation_steps = 8 \
    epochs = 3 \
    learning_rate = 1e-4 \
    weight_decay = 0.0 \
    max_len = 30 \

You can see the default values for each hyperparameter in src/config/config.py. If you simply execute rye run python train.py, fine-tuning will start based on the default values.

Model Performance

We ran the fine-tuning code on Single GPU (V100 x 1). Then, evaluated on validation set of MIND Small Dataset. Additionally, as a point of comparison, we implemented random recommendations (src/experiments/evaluate_random.py ) and evaluated.

Experimental Result

Model AUC MRR nDCG@5 nDCG@10 Time to Train
Random Recommendation 0.500 0.201 0.203 0.267 -
NRMS + DistilBERT-base 0.674 0.297 0.322 0.387 15.0 h
NRMS + BERT-base 0.689 0.306 0.336 0.400 28.5 h

Trained Model

To make it easy to try inference and evaluation, we have publicly released the trained model. Here are the links.

Model Link
NRMS + DistilBERT-base Google Drive
NRMS + BERT-base Google Drive

You can try it with the following script.

import torch
from recommendation.nrms import NRMS, PLMBasedNewsEncoder, UserEncoder

loss_fn: torch.nn.Module = torch.nn.CrossEntropyLoss()
pretrained = "distilbert-base-uncased"
news_encoder = PLMBasedNewsEncoder(pretrained)
user_encoder = UserEncoder(hidden_size=hidden_size)
nrms_net = NRMS(news_encoder=news_encoder, user_encoder=user_encoder, hidden_size=hidden_size, loss_fn=loss_fn).to(
    device, dtype=torch.bfloat16
path_to_model = {path to trained NRMS-DistilBERT Model}


  author = {Yuki Yada},
  title = {News Recommendation using LLM},
  year = {2023},
  publisher = {GitHub},
  journal = {GitHub repository},
  howpublished = {\url{https://github.com/YadaYuki/news-recommendation-llm}}


[1] BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding Devlin, J., Chang, M.-W., Lee, K., & Toutanova, K. https://aclanthology.org/N19-1423

[2] DistilBERT, a distilled version of BERT: smaller, faster, cheaper and lighter Sanh, V., Debut, L., Chaumond, J., & Wolf, T. https://arxiv.org/abs/1910.01108

[3] Neural News Recommendation with Multi-Head Self-Attention Wu, C., Wu, F., Ge, S., Qi, T., Huang, Y., & Xie, X. https://aclanthology.org/D19-1671

[4] Empowering News Recommendation with Pre-Trained Language Models Wu, C., Wu, F., Qi, T., & Huang, Y. https://doi.org/10.1145/3404835.3463069

[5] MIND: A Large-scale Dataset for News Recommendation Wu, F., Qiao, Y., Chen, J.-H., Wu, C., Qi, T., Lian, J., Liu, D., Xie, X., Gao, J., Wu, W., & Zhou, M. https://aclanthology.org/2020.acl-main.331

Using This Software in Your Work

We warmly welcome the use of this software in academic, research, or educational settings. If you decide to use our software in your paper or project, we would appreciate it if you could inform us. This helps us understand the impact and reach of our software and can guide future development and improvements.

Please contact us at: yada.yuki@fuji.waseda.jp


If you refer to or use our software in your research, please cite it as follows:

  author = {Yuki Yada},
  title = {News Recommendation using PLMs},
  year = {2023},
  publisher = {GitHub},
  journal = {GitHub repository},
  howpublished = {\url{https://github.com/YadaYuki/news-recommendation-llm}}