I'm sorry this is not an issue, it just a general question about GEC data pre-processing.
I'm a little confused about the standard GEC dataset format (error-annotated data .M2 format), How we can use the correction labels on the target side to improve the GEC model? instead of release it and feed it to the model as the pure parallel dataset.
Dear @all,
I'm sorry this is not an issue, it just a general question about GEC data pre-processing.
I'm a little confused about the standard GEC dataset format (error-annotated data .M2 format), How we can use the correction labels on the target side to improve the GEC model? instead of release it and feed it to the model as the pure parallel dataset.