LHNCBC / metamaplite

A near real-time named-entity recognizer
https://metamap.nlm.nih.gov/MetaMapLite.shtml
Other
58 stars 14 forks source link

Semantic Type/Source filtering happens after longest match exlcuding matches #9

Open dconlan opened 4 years ago

dconlan commented 4 years ago

If the string 'Metastatic prostate cancer' is processed with the semantic type of bpoc (Body Part, Organ or Organ Component) then no concepts are found.

This is because 'Metastatic prostate cancer' matches concept C1282496 and C0936223 but these are both neop (Neoplastic Process) concepts and are discarded when the semantic type filtering is applied.

I would have expected the longest match to be applied after the semantic type and source filtering.

i.e. the concept C0033572: prostate should have been returned.