NCATSTranslator / Feedback

A repo for tracking gaps in Translator data and finding ways to fill them.
7 stars 0 forks source link

SemMedDB filtering based on counts on opposing predicates #392

Open andrewsu opened 1 year ago

andrewsu commented 1 year ago

In this thread, there was this idea/suggestion:

the question is whether we should have a filter in text-mined resources to remove X - TREATS - Y when there are many more PMIDs associated with X - CAUSES - Y. This intuition seems to hold for the Halothane / malignant hyperthermia example. Semmeddb currently has 3 PMIDs for TREATS and 32 PMIDs for CAUSES. (link)

This ticket tracks the exploration/implementation of this idea.

sierra-moxon commented 3 months ago

@andrewsu - do you think this can be closed as completed?

andrewsu commented 3 months ago

We haven't implemented this in our version of semmeddb, and I'm not aware that others have either. I think we should leave this one open as a solid idea that would still be worth pursuing...