Open davidonlaptop opened 9 years ago
Similar to plink --cluster option. See the wiki on IBS-MDS Process and the diagram for the Genome file.
plink --cluster
More information can be found on the --cluster and --genome-full options in the section on Pairwise IBD estimation of plink manual.
--cluster
--genome-full
The input file is the model created in #3.
Add a comment to this issue with:
Add a comment to this issue describing how this will be implemented in Spark, and how it differs from plink.
Also update the class diagram on the wiki page describing PLink formats (when incomplete) and add a class diagram describing the models implemented in Scala for this feature on the wiki page on the MGL804 formats.
The implementation should use:
Important note: The model can be only in memory for now, but you'll need to integrate into the ADAM format later on. You'll probably need to create a new record type.
Description
Similar to
plink --cluster
option. See the wiki on IBS-MDS Process and the diagram for the Genome file.More information can be found on the
--cluster
and--genome-full
options in the section on Pairwise IBD estimation of plink manual.The input file is the model created in #3.
Analysis
Add a comment to this issue with:
Design
Add a comment to this issue describing how this will be implemented in Spark, and how it differs from plink.
Also update the class diagram on the wiki page describing PLink formats (when incomplete) and add a class diagram describing the models implemented in Scala for this feature on the wiki page on the MGL804 formats.
Implementation
The implementation should use:
Important note: The model can be only in memory for now, but you'll need to integrate into the ADAM format later on. You'll probably need to create a new record type.