GELOG / adam-ibs

Ports the IBS/MDS/IBD functionality of Plink to Spark / ADAM
Apache License 2.0
3 stars 6 forks source link

IBS clustering constraint: maximum cluster size (--mc, --mcc) #8

Open davidonlaptop opened 9 years ago

davidonlaptop commented 9 years ago

Description

Similar to plink --cluster with --mc or --mcc options. See the wiki on IBS-MDS Process and the diagram for the Genome file.

More information can be found on the --cluster and --genome-full options in the section on Pairwise IBD estimation of plink manual.

The input file is the model created in #3.

This feature adds a constraint on the --cluster option described in issue #7.

Analysis

Add a comment to this issue with:

Add a comment to this issue describing how this will be implemented in Spark, and how it differs from plink.

Implementation

The implementation should use: