Closed ZheFrench closed 1 year ago
Hi,
You are right, the counts are supposed to be integers. The fact that they are not, comes from a default parameter setting in SimBu: remove_bias_in_counts=TRUE
. This setting is used to remove an initial mRNA bias in the provided count matrices. If you do not want this and rather want to work with integer counts directly, just set this setting to FALSE
.
One could also think about setting this parameter to FALSE by default. I will think about it when working on the last few bug fixes.
I hope this answered your question.
Best, Alex
Yeah that's better :) I will give a try.
It's kind of normalization by the library size by cell type . But you also have a norm_counts
parameter that can be set, does it apply the same behavior ? That would be twice a normalization if both are set to TRUE, no ? I'm just curious.
Tks. Really nice idea this software by the way.
Good point! @alex-d13 we could even add a round
parameter, which, if TRUE, results in the application the round()
function on the counts. Of course, we should test that the final sequencing depths are not too different from the expected ones, and the NB properties we tested in the paper are still maintained.
Does the output SummarizedExperiment::assays(simulation$bulk)[["bulk_counts"]])` should be a simulated raw counts matrix ? It should be integer values, no ?
Also values seems to be extremely low, near zero every time. I was expecting something more "bulky", round high values. Could you comment ? Tks !
remove_bias_in_counts=TRUE
:
remove_bias_in_counts=FALSE