thomasvangurp / epiGBS

Code for working with epiGBS data
MIT License
10 stars 7 forks source link

Why is the sum of snps and methylated positions higher than the position in the merged vcf file? #21

Open FleurGaBru opened 6 years ago

FleurGaBru commented 6 years ago

zcat merged.tsv.gz | grep -v "^#" | wc -l 10261384 zcat snp.vcf.gz | grep -v "^#" | wc -l 494150 cat methylation.bed | grep -v "chr" | wc -l 9827493 snps + methylations sites = 10321643

Hence, there must be overlap between SNPs and methylated sites. Is that correct?