Closed zongxindu closed 2 years ago
I was really inspired by your insight into nested entity recognition, but I have a question: why didn't the paper try to use a large pre-training model(BERT) as token presentation?
I was really inspired by your insight into nested entity recognition, but I have a question: why didn't the paper try to use a large pre-training model(BERT) as token presentation?