ratishsp / data2text-macro-plan-py

Code for TACL 2021 paper on Data-to-text Generation with Macro Planning
MIT License
22 stars 10 forks source link
data-to-text-generation deep-learning natural-language-generation

data2text-macro-plan-py PWC PWC

This repo contains code for Data-to-text Generation with Macro Planning (Ratish Puduppully and Mirella Lapata; In Transactions of the Association for Computational Linguistics (TACL)); this code is based on an earlier (version 0.9.2) fork of OpenNMT-py.

Citation

@article{puduppully-2021-macro,
  author    = {Ratish Puduppully and Mirella Lapata},
  title     = {Data-to-text Generation with Macro Planning},
  journal = {Transactions of the Association for Computational Linguistics},
  year      = {2021},
  url       = {https://direct.mit.edu/tacl/article/doi/10.1162/tacl_a_00381/101876/Data-to-text-Generation-with-Macro-Planning},
  pages = {510--527},
  volume = {9},
}

Requirements

All dependencies can be installed via:

pip install -r requirements.txt

Code Details

The main branch contains code to generate macro plans from input verbalization. The code for training summary generation is in summary_gen branch for MLB and summary_gen_roto for RotoWire dataset.

The test outputs and trained models can be downloaded from the google drive link https://drive.google.com/drive/folders/1jJjq5IvuBKNLTAe7fuwlDYParrxpK-WD?usp=sharing

The steps for training and inference for RotoWire dataset are given in README_RotoWire, and for MLB dataset are given in README_MLB.