mlr-org / parallelMap

R package to interface some popular parallelization backends with a unified interface
https://parallelmap.mlr-org.com
Other
57 stars 14 forks source link

proper seeding #43

Closed jakob-r closed 4 years ago

jakob-r commented 8 years ago

Bernd wanted to open an issue as mentioned here. We might want to improve reproducibility in parallelMap by using seeding-mechanics for the different parallization techniques.

pat-s commented 5 years ago

Should we finally close this issue by incorporating the approach of https://stackoverflow.com/a/51347058/4185785?

I am afraid that many people messed up reproducibility in the past just because they were not aware of this. Also I think many people are not aware that set.seed() does not work for parallel calls.

Related issues

Suggested solution

parallelStart() would also gain an arg to turn this behavior off. By default it would ensure reproducibility in parallel scenarios without requiring knowledge by the user about different RNG kinds.

@berndbischl @jakob-r


On a side note: How does future deal with that issue when using plan(multicore) or plan(multisession)? @mllg (better discuss this one in a separate issue in mlr3)