Open jwijffels opened 3 years ago
Hi @jwijffels That is very helpful to me as I am in the process of improving the lemmatization with this tool and Pie. I made a separate model for the tagger and the lemmatiser although it is not yet in the repository. There are certainly some steps to be improved, starting with the split in a dev set. Many thanks for your advices !
I was just training a udpipe lemmatiser myself on dutch and browsed a bit github for udpipe_train and saw this. I'm the author of the udpipe R package and happen to work on some 18th-19th century texts myself (dutch / french). Let me give some advice to make the udpipe model better, in order of importance