somatic mutation spike in pipeline without docker

bioinform / somaticseq

An ensemble approach to accurately detect somatic mutations using SomaticSeq

http://bioinform.github.io/somaticseq/

BSD 2-Clause "Simplified" License

194 stars 53 forks source link

somatic mutation spike in pipeline without docker #70

Closed gianfilippo closed 5 years ago

gianfilippo commented 5 years ago

Hi,

is there a way to run the "in silico somatic mutation spike in pipeline" without docker ?

Do you have a version for singularity ?

Thanks

litaifang commented 5 years ago

The mutation spike in pipeline is not exactly a part of somaticseq. The docker scripts just run bamsurgeon (https://github.com/adamewing/bamsurgeon). So you can refer to bamsurgeon's repo to learn how to install and run their software. I do have singularity scripts for them, but it's not as well supported but we don't use it much internally so haven't debugged/fixed them routinely: https://github.com/bioinform/somaticseq/tree/master/utilities/singularities/bamSimulator

gianfilippo commented 5 years ago

Hi, thanks for the prompt reply!

I will take a look at both. Thanks

gianfilippo commented 5 years ago

Hi,

I am sorry, but I have to reopen this as I have more questions.

I am trying the singularity based scripts. In particular, the BamSimulator_multiThreads.sh.

It generates 20 numbered dir and a log dir under the OUTPUT directory.

As far as I understand, I have to run the scripts under each numbered dir and the one under the log dir at the end. Correct ?

Is each script running on multiple cores ? can I submit each script as Slurm job ?

Thanks

litaifang commented 5 years ago

Correct. Each script under the 20 numbered directories are designed as single-core jobs. I've designed to have those scripts submitted as sun grid engine jobs (e.g., the headers requesting memory, cores, and log file locations, etc. used SGE format), but you can try to submit those scripts as Slurm jobs.

litaifang commented 5 years ago

And then the final script under the OUTPUT/log has the command to merge all the bam and VCF files in each of the 20 sub-directories.... but only after all 20 jobs are complete.

gianfilippo commented 5 years ago

Great!

Thanks for your help!!