ncsa / MayomicsVC

Variant Calling Pipeline in Cromwell/WDL
MIT License
21 stars 5 forks source link

Increasing number of cores for running GATK4 tools. #107

Open ambarishK opened 5 years ago

ambarishK commented 5 years ago

Hi! I am very much interested and working over increasing scalability of GATK4 tool performances. There is explicit parameter setting for increased number of nodes over multi-node cluster. But currently I am using single node SPARK cluster and want to check for the GATK4 performances over multi-core machine. What parameter I have to deal with to increase the allotted number of cores?

Waiting for your reply.

azzaea commented 5 years ago

Hi! Your work looks interesting. Our gatk4 pipeline (in the dev-gatk branch) uses the non-spark invocations of GATK4 tools. All these tools run single threaded with the exception of the HaplotypeCaller. We provide access to the threads of theHaplotypeCallervia theHaplotyperThreads` parameter. Is this what you are looking for?

ambarishK commented 5 years ago

Yes. It will help me to deal with non-SPARK GATK tools especially HaplotypeCaller. Also, I found necessary parameters for spark based tools. Thank you so much.

Could I get your emailId for further correspondence.

azzaea commented 5 years ago

I think you might find this paper handy too; but have it your way.

One advantage of github issues is that your post can be seen by more than one person, and hence a quicker response. If more convenient though, this is my email: azzaea(at)gmail.com.