shunjizhan / Acrobat-Portal

Acrobat web portal for users to more conveniently interact with Acrobat project
2 stars 3 forks source link

Improve ner model #22

Open shunjizhan opened 5 years ago

shunjizhan commented 5 years ago

Such as: https://sklearn-crfsuite.readthedocs.io/en/latest/tutorial.html#hyperparameter-optimization

shunjizhan commented 5 years ago

when training, we used sentence.split() instead of word_tokenize() from nltk, since this will cause some little bug in training, but in the future, use tokenize to train might get better result.

(actually I think this doesn't matter)