obophenotype / human-phenotype-ontology

Ontology for the description of human clinical features
http://obophenotype.github.io/human-phenotype-ontology/
Other
293 stars 51 forks source link

Final decision on what to do with NCIt: EQ or xref ONLY #4323

Closed matentzn closed 5 years ago

matentzn commented 5 years ago

Which of these is the final executive HP decision

  1. We remove all NCIt references from EQs and push them into xrefs
  2. We replace all NCIt references in EQs by something else (what?)
  3. NCIt will be used to refer to physical entities (tumors etc) (they remain in the EQs)

I have attached a list of NCIt references that still occur in the EQs (207 affected HP classes). The procedure for the diligent curators is:

  1. Open hp-edit
  2. Search for NCIt term in above list. Click on it.
  3. Click on Usage tab in protege to find all HP classes referencing that NCIt class
  4. Delete EQ (or edit, if there is a replacement)
  5. Add xref to the NCIT term

But please, be very sure this is a good idea. I am against it; I would rather pretend NCIt is about something that it is not (physical entities, not diseases), but I understand your arguments not to make that assumption, so no need to re-iterate here. Just decide what to do!

@cmungall @pnrobinson @drseb @dosumis

dosumis commented 5 years ago

I strongly favour option 3. From what I understand of NCit this is a reasonable interpretation of what these terms refer to. If we dump in xrefs then pulling terms back out into EQ defs will be hard.

pnrobinson commented 5 years ago

Option 3 is actually 180 degrees wrong. The HPO by definition refers to a phenotypic feature (the physical entity, tumor) and not to the disease entity. In principle, the NCIT definition should be

Lung carcinoma
equivalent to
 {disease
    <keep all of the current NCIT logical statements, which are a work of art>
   characterized_by HPO:lung_carcinoma (meaning the physical tumor)
}

The HPO definition should be something like

Lung carcinoma
equivalent to
carcinoma (MPATH or NCIT)
and 
<Some differentia>

or (probably better)

Lung carcinoma
characteristic_feature_of NCIT:lung carcinoma
matentzn commented 5 years ago

Okay then. So we can detach the short term HP stuff a bit from the wider considerations: What about this:

  1. We remove all EQs and replace them by xrefs for now.
  2. The person that does this keeps a list of all classes that they have edited this way.
  3. We schedule a meeting in the next weeks to come up with an initial set of patterns for neoplasm terms.
  4. We turn the list collected in 2. into one or more DOSDP files, and instantiate against pattern (that way we did not lose anything).

What do you think?

dosumis commented 5 years ago

@matentzn - seems like a reasonable approach, pending agreement on logical definitions.

pnrobinson commented 5 years ago

@matentzn did we have a clear action item here? Downstream alg's can use xrefs by the way if they really want to, and I am not sure that this decision currently will impinge much on our software stack. Having these items as xrefs would in principle allow us to automagically generate definitions once we have settled the modeling.

matentzn commented 5 years ago

There is only one last optional action item remaining here:

Someone could open src/ontology/hp_ncit_eq.owl in protege and for every HP class in there, make sure that in hp-edit.owl, the appropriate xref is present. So if you see: HP:1000 equivalentTo: has part some X and inheres in some NCIT:0011, make sure that in hp-edit.owl, HP:1000 has an xref to NCIT:0011. That is all!

@nicolevasilevsky probably the best to do that?

nicolevasilevsky commented 5 years ago

@matentzn i am just doing this now - I looked at the first five terms in hp_ncit-eq.owl and I don't see the NCIt terms xref'd in hp-edit.owl, as you described above. Sounds like the should be though, yes?

Should I check all the terms in hp_ncit-eq.owl against hp-edit.owl, or does it already seem like something is wrong/missing?

matentzn commented 5 years ago

I would assume none of the terms in the ncit eq file currently have xrefs; so yes, if you want them they need to be added!

nicolevasilevsky commented 5 years ago

I talked to @matentzn and I am going to add NCIT xrefs to the HPO terms, as they currently do not exist.

nicolevasilevsky commented 5 years ago

this term: HP_0030741 'Mediastinal teratoma'. already has a NCIT xref to: NCIT:C6438 Mediastinal teratoma, which is more specific than the NCIT term used in the logical def in the hp_ncit_eq.owl file (which uses NCIT_C3403 teratoma.) Do you want to update the hp_ncit_eq.owl file, @matentzn?