Closed OceaneCsn closed 3 years ago
Thanks for the bug report @OceaneCsn! I will look into this and try to figure out what's happening with the Poisson model + K-means when running parallel = TRUE
, I'll let you know once (if?) I figure out what is going on.
Hi @OceaneCsn, I've fixed the issue. I had forgotten to pass the seed argument to the parallel processes for the Poisson/K-means cases, and that has now been fixed in the latest version (1.15.6 -> currently available in the master branch of this GitHub repo, it will probably take at least a day or two to show up on Bioconductor devel).
Let me know if you notice any other bugs, I appreciate your feedback!
Thank you @andreamrau for fixing this so quickly! I'll let you known if I find anything else :)
First, thanks for the amazing package!
After a brief exchange with Andrea (at the Netbio talks), I experimented a bit more with seeds and reproducibility in coseq. Even when passing a seed argument to the latest version of the
coseq()
function, I could not get identical coseq runs. Turns out theparallel
option might get in the way of reproducibility, at least for some of the models implemented in coseq. Here is what I get :parallel = FALSE
, Gaussian mixtures, Poisson mixtures and k-means can lead to repeatable results.(I used the correlation between the cluster assignments from two runs with identical parameters, that should be 1 for identical random states, the expected behavior)
parallel = TRUE
, Gaussian mixtures can lead to repeatable results, but not Poisson mixtures nor k-means.Can you reproduce this behavior?
If so, I don't know if there's a fix, but I thought I should share this in case it had gone unnoticed. (Maybe looking into
biocParallel
options for multithreading procedures and seed setting could be helpful).Bests regards
Session info :