Open aakrosh opened 4 years ago
Version 0.2.5 introduced two new parameters ('-subsample' and '-rep') that increase the stability of results by exploring consensus clustering properties to a greater extent.
The former parameter regulates the fraction of samples that are randomly removed from each factorization. While deciding which samples will be removed we explicitly make sure that all samples will be clustered at least once. The later parameter sets the number of times a subset of all runs (random 80% of runs) is used to create a consensus matrix.
The above error appears when there is a sample that was not clustered in any of the runs in a subset. This is very unlikely while using default sumo parameters, as factorization is run 60 times and only 5% of samples are removed from each run.
For now, I recommend using the higher number of repetitions or setting '-subsample' parameter to 0 (which prevents encountering this issue even if -n is very small), however, this issue will have to be addressed in the future.
sumo run
fails with the following message when a small number of repetitions (-n 2
in this case) is used.Is there a minimum value that should be specified for a successful run?