The Hoffmann et al. Cell paper was one of the first publications about SARS-CoV-2 that mentioned human proteins associated with the infection. The abstract of the paper contains sentences that should be picked by EPMC's text mining algorithm, for instance:
Here, we demonstrate that SARS-CoV-2 uses the SARS-CoV receptor ACE2 for entry and the serine protease TMPRSS2 for S protein priming
Here, ACE2 and TMPRSS2 should be tagged as targets and SARS-CoV-2 as the disease (it's a synonym of COVID-19-MONDO_0100096). It would also be acceptable if SARS-CoV was identified and annotated as Severe acute respiratory syndrome - EFO_0000694, which would be a false positive. However, this is not the case although there are four target-disease associations extracted from this paper:
@saha-shyamasree has been looking into it and she couldn't find any obvious explanations for it but she thinks that there is something wrong in the target annotation. It is important to fix this issue because we may be missing many more similar associations.
The Hoffmann et al. Cell paper was one of the first publications about SARS-CoV-2 that mentioned human proteins associated with the infection. The abstract of the paper contains sentences that should be picked by EPMC's text mining algorithm, for instance:
Here,
ACE2
andTMPRSS2
should be tagged as targets andSARS-CoV-2
as the disease (it's a synonym ofCOVID-19
-MONDO_0100096). It would also be acceptable ifSARS-CoV
was identified and annotated asSevere acute respiratory syndrome
- EFO_0000694, which would be a false positive. However, this is not the case although there are four target-disease associations extracted from this paper:@saha-shyamasree has been looking into it and she couldn't find any obvious explanations for it but she thinks that there is something wrong in the target annotation. It is important to fix this issue because we may be missing many more similar associations.