General question: GEC data preprocessing

nusnlp / mlconvgec2018

Code and model files for the paper: "A Multilayer Convolutional Encoder-Decoder Neural Network for Grammatical Error Correction" (AAAI-18).

GNU General Public License v3.0

185 stars 73 forks source link

General question: GEC data preprocessing #34

Closed aimanmutasem closed 4 years ago

aimanmutasem commented 4 years ago

Dear @all,

I'm sorry this is not an issue, it just a general question about GEC data pre-processing.

I'm a little confused about the standard GEC dataset format (error-annotated data .M2 format), How we can use the correction labels on the target side to improve the GEC model? instead of release it and feed it to the model as the pure parallel dataset.