Closed zhou-zh closed 6 years ago
hello~
i understand that you want to train other language model. if then, you can check this issue.
SRC_CORPUS_DIR=${CDIR}/UD_English
TRAIN_FILE=${DATA_DIR}/en-ud-train.conllu.conv
DEV_FILE=${DATA_DIR}/en-ud-dev.conllu.conv
you can check https://github.com/tensorflow/models/issues/1211#issuecomment-287744105
@dsindex , thanks for your reply ! If i hope change a language to trian, I should just modify the path to the data set for corresponding language ? We do not need to use the different models provided by the CoNLL2017 baselines guide ? I thought that different lanuage models have different word-map.
@continuesmile yes~ place a corpus to the path and modify script for training your own model.
the models provided by the CoNLL2017 baselines guide were trained by https://github.com/tensorflow/models/tree/master/syntaxnet/dragnn/tools
those script are the original one. mine is modified version for convenience.
Hi @dsindex,
Should I train the segmentation by myself ? I trained the model with UD Chinese Corpus, but the UAS, LAS only 68.36%, 58.96%, much worse than baseline. Do you have some hint ?
Thanks again
Thanks for your great works!I saw your reply on stackoverflow, i know you have built your own system, i have two problemsa about it: