Closed ning-y closed 5 months ago
So I don't think this is an issue. They are saying that the column ordering of the bootstrap matrices is not deterministic. This doesn't affect downstream tools which look at marginal stats like variance, or even covariance, which don't care about bootstrap ordering.
I quantified bulk RNA-seq with kallisto quant with default arguments (therefore, no bootstraps) and ten threads.
I then imported the h5 files via tximport.
The results of tximport are saved to TSV. If I repeat this process again with no changes to get another TSV, I find that the two TSVs are different.
If I set kallisto to single-threaded execution, making no other changes, I get a deterministic result: the same TSV every run.
I report this issue here rather than with kallisto, because kallisto authors have already responded to threading and determinism here: https://github.com/pachterlab/kallisto/issues/236#issuecomment-565616059. They say that the multi-threaded kallisto output only looks different due to randomness in which threads finish first.