monarch-initiative / mondo

Mondo Disease Ontology
http://obofoundry.org/ontology/mondo
Creative Commons Attribution 4.0 International
232 stars 53 forks source link

Investigate why some diseases match to multiple patterns #2273

Open matentzn opened 3 years ago

matentzn commented 3 years ago

See this comment as an example:

https://github.com/monarch-initiative/mondo/pull/2268#issue-517185148

nicolevasilevsky commented 3 years ago

inflammatory_disease_by_site.yaml is the correct pattern for the example in #2268

specific_infectious_disease_by_location - the variable is MONDO:0000001 disease, it should probably be more specific, like MONDO_0005550 'infectious disease'

specific_inflammatory_disease_by_site.yaml the variable is MONDO:0000001 disease, it should probably be more specific too, MONDO_0021166 'inflammatory disease'

nicolevasilevsky commented 3 years ago

Are there other cases like this, @matentzn?

matentzn commented 3 years ago

I think you are right! You should fix these patterns as you suggest! Other cases of overlapping patterns:

The entire list of all disease with duplicate matches this way can be found here.

cmungall commented 3 years ago

hmm, for the specific_X patterns is there a way to say that filler should be a proper subclass of disease?

nicolevasilevsky commented 3 years ago

I think you just have to say the variable is the more specific subclass, like for specific_infectious_disease_by_location

it should be:

classes: infectious disease: MONDO:0005550 anatomical structure: UBERON:0000061

matentzn commented 3 years ago

@nicolevasilevsky suggestion is the only way - you cant say "proper subclass" as in "any class that is a subclass of the filler, but not the filler itself.

matentzn commented 3 years ago

@cmungall Is there any problem with Nicoles suggestion?

nicolevasilevsky commented 3 years ago

@matentzn suggested that @sabrinatoro read through and try to solve this ticket. This type of problem would be nice to mention in the paper.

sabrinatoro commented 2 years ago

Analysis is here: https://docs.google.com/spreadsheets/d/1stK1FPjNRgNjt0QJiqRKp7nMjvq08yuBfIoIy7uVvaI/edit?usp=sharing

I compared the pattern pairs from the list above. The analysis shows that :