wodanaz / Assembling_viruses

0 stars 0 forks source link

Adds sbatch scripts for Escape_Variants.md steps 3-5 #4

Closed johnbradley closed 3 years ago

johnbradley commented 3 years ago

Adds changes to run the following steps from Escape_Variants.md

Adds sbatch-array.sh script to simplify escape-variants-pipeline.sh. This script runs a sbatch array job appropriate for a file containing a list of filenames to process.


For all module load commands without specific versions I am adding the default version. This is for better reproducibility.


The Create BAM from SAM and make an index was using the [-N/--nodes] argument to specify one node:

#SBATCH -N 1

I removed that line because I didn't think it was having any effect.


The Create a dictionary file for using picard tools step creates a MT246667.dict file. This step cannot be run twice without deleting the MT246667.dict file first. So subsequent runs of the pipeline will fail if this file is not deleted. This problem will be fixed as part of issue #3.

johnbradley commented 3 years ago

As of now the pipeline looks like this:


./run-escape-variants.sh
       `-> scripts/escape-variants-pipeline.sh
                    `-> scripts/index-reference-genome.sh
                    `-> scripts/remove-nextera-adapters.sh
                    `-> scripts/map-bwa-cleaned-libs.sh
                    `-> scripts/create-bam-from-sam.sh
                    `-> scripts/create-picard-dictionary.sh