Open matentzn opened 1 year ago
@matentzn , I may need a upheno.sssom.config.yml
file for the mapping, correct?
For which part exactly?
filtering prefixes. Do we need to do that? Similar to this.
Yes, better.. There is sooo much noise we don't care about in uPheno!
Where do I grab that from? Or does that need to be curated?
No great file atm, but you can search this file https://github.com/obophenotype/upheno-dev/blob/master/src/curation/upheno-config.yaml for prefix_iri
to get a good starting point!
Ok, the robot
code generates the upheno-minimal.owl
file but after running oaklib
, there aren't any mappings generated. So I used the entire file to generate the mappings and the results look promising. This takes around 22 mins.
for this not to produce false positives like
DDPHENO:0000004 decreased migratory slug size (DDPHENO) skos:exactMatch DDPHENO:0000379 decreased vegetative cell size (DDPHENO) semapv:LexicalMatching oaklib 0.9411764705882353 oio:hasDbXref oio:hasDbXref pmid:18366659
we need the phenotype ontologies to follow standards and not use pmids as class xrefs
this may take a while to percolate, so should we filter at the robot level or the oak level?
I opened a parallel PR by accident. Sorry Nico!
@cmungall i am happy to lead a push on cleaning out these xrefs, but first we need to answer this: https://github.com/information-artifact-ontology/ontology-metadata/issues/94