dmis-lab / BioSyn

ACL'2020: Biomedical Entity Representations with Synonym Marginalization
https://arxiv.org/abs/2005.00239
MIT License
160 stars 26 forks source link

Composite identifier predictions meaning #15

Closed sanyabt closed 10 months ago

sanyabt commented 1 year ago

Hi, thank you so much for your work!

I have a question about the composite predictions from the model - in your README example, the model predicts multiple identifiers "D001260|208900" in the top 5. Does this mean that the model found both terms to be similar to the mention text based on probabilities or it is providing alternate possible identifiers? Just want to understand what predicting multiple identifiers implies in BioSyn predictions. Thanks!

mjeensung commented 1 year ago

Hi @sanyabt

Thanks for reaching out to us. We are using the MEDIC dictionary for disease.

If you take a look at the site, it provides 'DiseaseID' as well as 'AltDiseaseIDs'. So, 'D001260|208900' can be interpreted that 'D001260' is a 'DiseaseID' and '208900' is a 'AltDiseaseIDs'.