monarch-initiative / semsimian

Simple rust implementation of semantic similarity
BSD 3-Clause "New" or "Revised" License
8 stars 5 forks source link

Produce specific similarity artifacts for Exomizer using custom counts #125

Open caufieldjh opened 5 months ago

caufieldjh commented 5 months ago

Based off of #124

Each will need to incorporate the counts from HPOA (for HP) and Monarch's phenotype files (for MP and ZP) - or in the latter case from wherever Monarch is getting them.

caufieldjh commented 5 months ago

See the gene to phenotype tables here: https://data.monarchinitiative.org/latest/tsv/gene_associations/index.html

These appear to be identical to the owlsim tables already in use by Exomizer (see https://archive.monarchinitiative.org/latest/owlsim/data/Danio_rerio/Dr_gene_phenotype.txt)

justaddcoffee commented 5 months ago

To flesh this out a bit, the plan we discussed I think

caufieldjh commented 5 months ago

IMPC phenotypes may not already be included in Monarch G2P tables, but Exomizer does use them.

caufieldjh commented 4 months ago

So to get this done, we will need to:

caufieldjh commented 4 months ago

See also #47 - but for terms missing from IC map instead of closure map

caufieldjh commented 4 months ago

If the closure map and IC map do not contain each others' keys, raise an error. We will work under the assumption that the user will provide inputs which are in alignment and contain all necessary IDs