howardyclo / papernotes

My personal notes and surveys on DL, CV and NLP papers.
128 stars 6 forks source link

A Multilayer Convolutional Encoder-Decoder Neural Network for Grammatical Error Correction #9

Open howardyclo opened 6 years ago

howardyclo commented 6 years ago

Metadata

Authors: Shamil Chollampatt and Hwee Tou Ng1 Organization: National University of Singapore Release Date: 2018 on Arxiv Link: https://arxiv.org/pdf/1801.08831.pdf

howardyclo commented 6 years ago

Summary

This paper is the first group to successfully employ a fully convolutional Seq2Seq (Conv Seq2Seq) model (Gehring et al., 2017) on grammatical error correction (GEC) with some improvements: BPE tokenization, pre-training fastText on large native English corpora, pre-training auxiliary N-gram language model as re-scorer and incorporating task-specific features. The experiment results outperform strong statistical machine translation baseline and all previous neural-based approaches on this case in terms of CoNLL-2014 (grammaticality) and JFLEG dataset (fluency). Additionally, the attention visualization of LSTM-based and Conv-based model shows that LSTM-based model tends to focus on matching source words as opposed to Conv-based model which tends to focus on surrounding context words.


Highlights of Technical Detail


Experiment Settings


Experiment Results

Table 1

Table 2 & 3

Notes on Table 3:

Figures

Notes on attention visualization:


Insights


Future Work


References