Closed zerogerc closed 2 years ago
@zerogerc Hi, the data used for training the model is a combination of training datasets for 12 languages selected from UD2.3. Then the model is evaluated on their respective dev/test sets.
@yzhangcs hi, but there are several treebanks for each language. Like ewt, gum or atis for english.
@zerogerc I just follow this paper for data preprocessing, and here is their released data. Note that UD2.2 treebanks were adopted by the CoNLL18 shared task. Here I use UD2.3 instead.
I see, thanks for the help!
Hi, I've read the following in the README:
How could I know which treebanks were used for training and evaluation?