Closed apcamargo closed 4 years ago
Thanks for being helpful as always and reporting this.
This happened because most of the data values are zeros, therefore clust found that more than 98% of the data are below the value of 30.0, concluding that the data is in log-scale. Clust treats the <2% of data larger than 30.0 as outliers in this case. This causes clust sometimes to calculate 2.0 to the power of the values, which throws this warning for the large values in the dataset.
I implemented a quick fix to overcome this problem, and have now released a new version v1.8.10 with this fix, in addition to your previous contribution of plots transparency, README edits, and some other minor fixes that I did.
Thanks again for your feedback!
Thanks!
In any case, I think it's best if I disable the automatic normalization then as the data is zero inflated, not in log-scale.
Hi Basel,
I'm trying to use Clust in a count matrix with 28582 rows and 84 columns (excluding row names and column names), and I'm getting some warnings during the pre-processing step. The results seem normal.
This is the first time I'm getting these warnings. They didn't show up in any of my previous analysis.
I'm using NumPy 1.15.4.
count_matrix.tsv.zip