Reconstruct hetionet - Githubissues

Then if the max, mean, or median score is above some threshold you say there exists an edge between X and Y?

Almost. You could argue that an edge only exists given a threshold, but right now each edges has a likelihood score of existing given the max value of each sentence score

What is your gold standard in this case?

The gold standard here is the edges that are already existing in hetionet.

How did you decide the threshold?

The threshold I chose for that figure was a bit arbitrary (0.5 since probabilities). Note that 0.5 is not an optimal cutoff for every application. there are more applications that prefer to select a higher threshold because they want less noise compared or vise versa. Ideally, when I load this into hetionet I'll be incorporating all scored edges and just give your threshold may vary disclaimer. That's the beauty of confidence scores

I guess the only downside is if there are conflicting evidences.

Very good point. It be interesting to see the edges that have conflicting evidence. I agree using the max loses out on interesting information as ^. Future todo is to get more creative on how to translate sentences into edges.

greenelab / snorkeling

Reconstruct hetionet #100