Closed gui-li closed 5 years ago
When I trained the lstm_crf model only with Phone_number a and email tag, the recall and precision is not 0. But I want to predict all labels together at once so that I can compare it with other model.
Yes, it's the problem of word embedding. Some of the out-of-vocabulary words will be set to tensor of 0 and they will likely be ignored by the network while the word embedding trainning option is set to False. However, there is some solutions here:
I tried to run my own dataset with your lstm_crf code, but some of the tags got 0 precision and 0 recall. Specifically, it's the phone_number set and email set. So I was wondering what happen. Is that the problem of embedding? Cause I noticed phone numbers and emails are not in the glove.840B.300d.txt embedding vectors. However, what should I do to effectively train these two tags instead of 0? Thank you for your help ahead.