dib-lab / farm-notes

notes on the farm cluster
16 stars 9 forks source link

some snakemake principles - #45

Open ctb opened 2 years ago

ctb commented 2 years ago
ctb commented 2 years ago

Mostafa Shokrof

Add resources tag to the jobs with the expected memory and cpu and time. It makes it easier when using cluster

Mostafa Shokrof

Also, separate the input file paths from the script so it will easier to run on different datasets

SichongP commented 2 years ago

if running with --use-conda, don’t run snakemake/genome-grist/charcoal/sgc in a new directory every time; snakemake puts the conda environment stuff under .snakemake/conda in the current working directory, and this will be recreated every time you start in a new directory.

Also --conda-prefix allows you to set a shared conda directory for all snakemake projects. Any rule with same env.yaml files can then re-use prebuilt conda envs

if you didn’t create the file, snakemake did; you can safely remove it and snakemake will recreate it!

Not the .snakemake directory. It's where all run-related metadata are kept :)

ctb commented 2 years ago

if you didn’t create the file, snakemake did; you can safely remove it and snakemake will recreate it!

Not the .snakemake directory. It's where all run-related metadata are kept :)

(well, you can remove it but snakemake might rerun stuff that you'd prefer not to be rerun ;)