broadinstitute / gatk-protected

Obsolete/Legacy GATK repository -- go to https://github.com/broadinstitute/gatk instead
BSD 3-Clause "New" or "Revised" License
33 stars 20 forks source link

CNV ConvertACNVResults: tag doc, give example command, question summary #1080

Closed sooheelee closed 7 years ago

sooheelee commented 7 years ago

Pending questions

ConvertACNVResults original summary blurb:

Convert files into TITAN and Broad CGA Allelic CapSeg (ACS) formats.  This tool uses spark, though running locally is fine.
As part of this process, this tool generates calls whether a particular segment is balanced (MAF=0.5)
IMPORTANT:  The additional CNLoH calls from this tool should be treated with a lot of skepticism.  Preliminary results indicated very poor performance.
LeeTL1220 commented 7 years ago

@sooheelee Answers to questions...

sooheelee commented 7 years ago

Made some edits based on your answers @LeeTL1220. One unanswered question:

"This tool uses spark, though running locally is fine." Is using Spark the default? What parameters are involved to use or not use Spark (e.g. --sparkMaster)? Can I instead say "To enable Spark, use the --sparkMaster option."?

LeeTL1220 commented 7 years ago

You cannot disable spark in this tool. The default value for sparkMaster parameter is "local[*]", which runs the tool locally (using all available cores), rather than on a spark cluster.

sooheelee commented 7 years ago

Hmm, wouldn't it be better to be able to disable Spark altogether, e.g. with --disableSpark, like our other tools do? I think I read somewhere using Spark locally adds inefficient overhead to the compute. @LeeTL1220

I do see the appeal of having only one Spark parameter though.

sooheelee commented 7 years ago

Made changes to rename example command files and also to clarify the tool is non-Spark by default but can use Spark. Once the checks complete, I will just merge this.

LeeTL1220 commented 7 years ago

Sounds good.

On Fri, May 26, 2017 at 11:11 AM, sooheelee notifications@github.com wrote:

Made changes to rename example command files and also to clarify the tool is non-Spark by default but can use Spark. Once the checks complete, I will just merge this.

— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub https://github.com/broadinstitute/gatk-protected/pull/1080#issuecomment-304308506, or mute the thread https://github.com/notifications/unsubscribe-auth/ACDXk4rHJfpJ4nAgdidu1OxwuHACfrOnks5r9uuggaJpZM4Nh0NO .

-- Lee Lichtenstein Broad Institute 75 Ames Street, Room 8011A Cambridge, MA 02142 617 714 8632