thunlp / NRE

Neural Relation Extraction, including CNN, PCNN, CNN+ATT, PCNN+ATT
MIT License
810 stars 309 forks source link

First word match could be not right entity mention. #7

Closed serhii-havrylov closed 7 years ago

serhii-havrylov commented 7 years ago

From this line of code, It seems that the first match between a head(tail) entity and a word is treated as right entity mention in the sentence. But in the case when a sentence has several mentions of the entity it is not necessarily true.

head: brooklyn
tail: eastern parkway
sentence: brooklyn museum , 200 eastern parkway , brooklyn , (718) 638-5000 .

Original dataset contains necessary index information, but it seems that preprocessed data in this repo doesn't have it.

Please correct me if I am wrong.

Mrlyk423 commented 7 years ago

We find that the index information have no influence of the final performance in this dataset. Hence, in this code, we just ignore it. Thanks.