StreamingDialogue: Prolonged Dialogue Learning via Long Context Compression with Minimal Losses

This repository is the official implementation of StreamingDialogue: Prolonged Dialogue Learning via Long Context Compression with Minimal Losses. The code is developed based on LLaMA-Factory.

Requirements

To install requirements:

pip install -r requirements.txt

Training

To train the model(s) in the paper, run this command:

SMR&LMR

cd src
./train.sh
./merge.sh

Supervised Learning

Modify the file /src/llmtuner/train/sft/workflow.py:
- Comment out lines 157-240.
- Uncomment lines 83-94.
Modify the file /model/llama/modeling_llama.py:
- Comment out lines 1049-1052.
- Uncomment lines 1029-1032.

cd src
./train.sh
./merge.sh

Evaluation

Generate

cd src
python generate.py

Inference

cd src
python infer.py

Evaluate

evaluate PPL

cd src
python eval_ppl.py --base_model /model/strdialogue-merge --eval_path /data/test_msc.json

evaluate various metrics

cd src
python compute_score.py

Pre-trained Models

You can download pretrained models here:

JinaLeejnl / StreamingDialogue

readme