vncorenlp / VnCoreNLP

A Vietnamese natural language processing toolkit (NAACL 2018)
Other
587 stars 145 forks source link

Retrain model POS, Ner for wrong capital letters in data? #1 #7

Closed doannam020293 closed 6 years ago

doannam020293 commented 6 years ago

I want to predict POS, but my data have syllable with wrong capital letters. So result is not good. So I want to retrain model without "capial word" feature, or add more lower case data set in training dataset. What do you thinks?

datquocnguyen commented 6 years ago

I think it's a possible solution, worth trying on your data. Good luck with that!