Predicting Discourse Trees from Summarizers

The official code for the NAACL paper: Predicting Discourse Trees from pre-trained Transformer-based Neural Summarizers

Get Started

python==3.6.7

transformers==4.3.2

torch==1.6.0

The link to the models we used in this paper: link

You can also train your own summarizer by simply use the code here by using self-attention as attention type.

As two of the three datasets are not open source, you can find the datasets here.

Before you can use the summarization model to parse the documents, you need first pre-process the datasets as the format shown in the example.

We use the Jupyter Notebook to Pre-process the data, and you can find all you need in Data Processing & Result Analysis.ipynb.

The processed GUM data can be found in the data folder.

Use build_gt_tree.py to generate the ground-truth constituency tree and dependency trees

Simply specify the number of layers and heads, as well as the model name and gpu to use.

python3 main_const.py -n_layers 6 -n_head 8 -device 0 -model_name cnndm-6-8

Simply specify the number of layers and heads, as well as the model name and gpu to use.

python3 main_dep.py -n_layers 6 -n_head 8 -device 0 -model_name cnndm-6-8

python3 main_bsl.py