Open cbizon opened 2 years ago
At the moment, we have:
"[\"PUBCHEM.COMPOUND:16666\", \"RXCUI:393549\", \"RXCUI:1046245\", \"RXCUI:1046531\", \"RXCUI:1235314\", \"RXCUI:1245396\", \"RXCUI:1486320\", \"RXCUI:1795938\", \"RXCUI:1863585\", \"RXCUI:1994323\", \"RXCUI:1999886\", \"RXCUI:1999887\", \"RXCUI:1999888\", \"RXCUI:1999889\", \"RXCUI:2102940\", \"RXCUI:2105551\", \"RXCUI:2566881\", \"RXCUI:2568111\", \"UMLS:C2698944\"]\n"
)conflate
(for gene/protein conflation) and drug_chemical_conflate
(for drug/chemical conflation)We probably want to roll these up into:
chemical_drug_db|RXCUI:1486320
), and since we only have ~9k DrugChemical conflations and 8.9 million GeneProtein conflations, I don't think it would be unreasonable to put them all into a single place, although of course lookups might be slower.conflations
) to which a list of conflations to apply could be provided.
Currently we have gene/protein conflation.
The ability to handle other types of conflation is partially implemented. Specifically, the backend parts are probably reasonably close, but we don't have a very good parameter scheme for specifying different types of conflation independently.