Closed davidonlaptop closed 9 years ago
Informations pouvant être utiles
Visualisation de contenu dans le fichier .bed : $ xxd -b test.bed
Fonctions : calc_genome_emitn
Bonjour, pourriez-vous faire un statut sur ce point ?
Voici le lien vers la page du Wiki décrivant les champs du fichier .genome: https://github.com/GELOG/adam-ibs/wiki/Algorithm-for-Pairwise--IBS-IBD-computation-(--genome)
Description
Similar to
plink --genome
option. See the wiki on IBS-MDS Process and the diagram for the Genome file.The input files are those created in #2.
The fields required for --cluster and --mds-plot are:
There is more fields, but they are will be done in Part II.
Analysis
Add a comment to this issue with:
Design
Add a comment to this issue describing how this will be implemented in Spark, and how it differs from plink.
Also update the class diagram on the wiki page describing PLink formats (when incomplete) and add a class diagram describing the models implemented in Scala for this feature on the wiki page on the MGL804 formats.
Implementation
The implementation should use:
Important note: The model can be only in memory for now, but you'll need to integrate into the ADAM format later on. You'll probably need to create a new record type.