helper script for generating family data from sample-metadata:
Pedigree
Pedigree in PLINK format
sample -> participant reverse lookup
Checklist
[ ] Related Issue created
[x] Tests covering new change
[x] Linting checks pass
Misc
Early version of this script (#1) made some incorrect assumptions about the relationship between participants and samples. This version allows for a 1 -> n relationship, and all possible variations will be present in the final PED file
Use of the de novo functionality in Hail required the generation of a Pedigree in PLINK format, so that has been added as an option. I've confirmed that Peddy also accepts PLINK and .ped files interchangeably, so .fam is the new default for this application.
The mapping back from samples to participants is required when formatting results for presentation back to clinical analysts, so creation of that mapping has been included as a step here.
Fixes
Proposed Changes
Checklist
Misc
Early version of this script (#1) made some incorrect assumptions about the relationship between participants and samples. This version allows for a 1 -> n relationship, and all possible variations will be present in the final PED file
Use of the de novo functionality in Hail required the generation of a Pedigree in PLINK format, so that has been added as an option. I've confirmed that Peddy also accepts PLINK and .ped files interchangeably, so .fam is the new default for this application.
The mapping back from samples to participants is required when formatting results for presentation back to clinical analysts, so creation of that mapping has been included as a step here.