dieterich-lab / rp-bp

Rp-Bp is a Bayesian approach to predict, at base-pair resolution, ribosome occupancy and translation.
MIT License
7 stars 5 forks source link

Improve parallelisation #130

Closed eboileau closed 1 year ago

eboileau commented 2 years ago

As part of the project, we discussed how to improve parallelisation, in particular

In rp-bp (pbiotools), we use joblib for simple parallelism in many places, and most problems are embarrassingly parallel, except maybe this one. Internally, we call joblib.Parallel(...). All wrappers are under pbiotools/misc/parallel.py. There is a draft module pbiotools/misc/dask_utils.py, but it is not currently used (and may not be up-to-date).

In addition, many scripts have the option to be submitted (possibly with dependencies) to SLURM, using slurm.check_sbatch(...). Options handling/logging, etc. is handled in pbiotools/misc/slurm.py.

There are obviously many questions (at least I have):

eboileau commented 1 year ago

Addressing these issues would require significant changes, and so far we don't seem to have any particular problem with the CmdStanPy interface. Setting as low priority.