Closed lemieuxl closed 8 years ago
This can be done between the prunning and the extraction of the pruned markers.
We can only do a set comparison between the pruned markers (in the *.prune.in
file) and the autosomal markers (from the bim
file).
It looks like the --indep-pairwise
already select autosomal markers (it process only autosomal markers). As described in Plink's manual:
[...] it is probably best to apply this analysis to a subset that are pruned to be in approximate linkage equilibrium, say on the order of 50,000 autosomal SNPs. Use the --indep-pairwise and --indep commands to achieve this, described here.
After more scrutiny, it appears that the sexual chromosomes are indeed pruned (unlike what is written in Plink's documentation). We need to address this.
When creating the genome file, only the autosomal markers should be present in the dataset (for computing IBS and MDS file.
This should only affect the
find_related_samples
script sincethe same input data will be used for creating the MDS file in the
check_ethnicity` script.