Closed 1251480932 closed 6 months ago
Hi,
The original data for the paper was removed due to a system transfer.
For guidance on dataset creation, you can consider the following steps:
We provide a sample dataset in toy_data. Specifically, the "training" directory contains the source texts, target texts and parsing results (step 1), whereas the "training_triplets" directory contains the final sequence-to-sequence syntax-aware data for training (step 3).
Dear GitHub author, Would it be possible for you to make your dataset publicly available? Also, could you provide some guidance on how to create a dataset if I wish to process my own data?