Closed nokimchen closed 3 years ago
Hi @nokimchen, It's probably like you say: not all users have information about the same genotypes. If you look at https://opensnp.org/genotypes, where the data comes from, you'll see that the data comes from different platforms (e.g. 23andme, ancestry), and therefore likely from different genotype chips. This means that we have no information for individuals for which the SNP was not genotyped and not imputed.
More generally, if you want to have more accurate information about allele frequency, I'd recommend using something like gnomAD https://gnomad.broadinstitute.org/variant/5-111182198-G-A?dataset=gnomad_r2_1, which summarizes data from thousands of individuals, and groups them by population.
Hi all, this question is not really about the package... i am new to SNP studies. So, kindly pardon me for my newbie questions..
I would like to identify the frequency of genotypes of vairous SNPs in a gene For example, when i try to get the genotype of 10 users, i get the genotypes for all the 10 users
but when i try to get the genotype of the same 10 users, but different SNP rs id, i get genotype of only 7 users
genotypes('rs11952607', userid='1,6,8,10,11,13,14,16,17,19', df=TRUE)
My question is that, why am i not getting the genotype info of the remaining 3 users for rs11952607? is it because rs11952607 SNP was not detected in their genome? so, it is safe to say that for rs11952607 there is GG genotype for 5 individuals and AG for 2 individuals and the remaining 3 individuals has no SNP in rs11952607 ? and also rs664517 is more common (found in all 10 individuals) while rs11952607 is less common (found only in 7 individuals) ? based on these 10 users
NOTE: xxxxxxxxxx represent the usernames