Open batelz opened 8 months ago
Hi @batelz.
From the look of your .ped files, it looks like indeed the first one is mostly empty (majority of genotypes missing as 0 0), making it so that no SNP seems to have data in both samples. Perhaps something went bad when converting from EIG to PLINK?
Hi,
When running the plink2tkrelated pipeline (which is VERY easy to use -- thanks for that!), The analysis is skipped because it can't find overlapping SNPs.
The input: I prepared two separate .ped files, each corresponding to an individual, from a genotype file that I converted from EIG format to PLINK format. Given that I'm working with the 1240K genotype, and these samples are relatively high coverage (approximately x0.6 and x9), it's highly unlikely that the two samples do not share any SNPs.
I am using the 1240K allele frequencies you provided - just for testing.
My .ped files look like this: (I'm showing two samples as an example ) SN1:
15 SN1 0 0 1 -9 0 0 G G 0 0 0 0 0 0 0 0 A A 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 A A .....
SN2:1 SN2 0 0 1 -9 A A A A 0 0 T T G G C C A A 0 0 C C T T C C G G T T C C .....
and maps: SN1:SN2:
Output:
Any ideas what could have gone wrong? Thanks.