fraenkel-lab / OmicsIntegrator

This repository is the working directory for the Garnet-Forest bundle of python scripts for analyzing diverse forms of 'omic' data in a network context.
http://fraenkel.mit.edu/omicsintegrator
BSD 2-Clause "Simplified" License
31 stars 21 forks source link

NaN prizes and edge weights #26

Open agitter opened 7 years ago

agitter commented 7 years ago

forest.py should check for NaN prizes and edge weights. I accidentally ran it with a NaN prize. It propagates the data to msgsteiner, producing output files that are technically valid but don't make sense. In my case, they ignored the specified root node.

I'll make this fix myself when time permits.

agitter commented 7 years ago

And it should also check for infinite prizes, which cause similar problems.

agitter commented 7 years ago

Does anyone else have an opinion about the preferred forest.py behavior when we encounter NaN or infinite prizes? We could:

1) Add a new warning that states the number of invalid prizes and continue running with the valid prizes 2) Stop execution

I will implement 1. unless there are other opinions. It seems most consistent with the rest of our error handling.

AmandaKedaigle commented 7 years ago

Yeah, I think 1 is good. There could be some kind of threshold for what percentage of prizes must be valid for it to continue, the way we handle protein name mismatches, but I don't feel strongly either way.