Open OYE93 opened 5 years ago
I tried the code for crf_loss and softmax_loss, both using default param, the following are the results:
--crf=False
--crf=True
Hey, i want to know the data set.The first raw is word ,the fourth raw is the label, what's the second and third raw meaning? Another question is the output label_test.txt , its second and third raw are same, does it have another meaning ?
Hi, I think you mean the col, I guess the 2nd col is part-of-speech(POS), the 3rd col is word segmentation. in label_test.txt, the third col should be the predicted tags, so the 2nd and 3rd col can not the the same. you can compare the 2nd and the 3rd col to evaluate the prediction
Thank you, I got it. Another question is that if my dataset don't have the 2nd col (POS) and the 3rd col (word segmentation) ,can this model run?Because in the BERT_NER.py ,i don't see any treatment about the 2nd column and the 3rd.
off course, only the 1st and 4th col are necessary, you just transform your data to 2 col format, you can use the code for training and testing
Thank you very much!!!
:)
Excuse me, i have another question. Where does the label_test.txt come from? Man made or machine generated?
Did this program run successfully?
label_test.txt is generated, seems successful. now you can run this on your own dataset.
OK. I run my own data. But i have some problem show in the picture: the left is author's data ,the right is mine
I just found out crf=False does not work. You always use crf layer.
I raised an issue:
Hello, thanks for your job. I have a question about different loss function, is there any difference for performance when using different loss function, like crf_loss and softmax_loss? thanks.