andrewparkermorgan / argyle

An R package for import, QC and analysis of Illumina Infinium genotyping arrays
31 stars 10 forks source link

REF and ALT alleles shouldn't match, but they do at these marker #9

Closed Gabriel-Kirchgraber closed 4 years ago

Gabriel-Kirchgraber commented 4 years ago

Hi, I had the same problem as a previous poster (#5) with the SNP_Map file provided by Neogene, it had the TOP/BOTTOM customer Strand etc. encoding for reference and alt alleles. I followed your advice and constructed them myself from my samples (31), but for some SNPs there is no alternative allele, it's the same in all samples, so I get the above warning. Does this pose a problem for further analysis? For example if I start comparing to the consensus calls of other strains/substrains where an alternative allele exists. Maybe I should construct new ref/alt allele map with the consensus calls? I am doing the new miniMuga v2 assay and have consensus calls for ca. 300 strains/substrains. Best, Gabriel.

andrewparkermorgan commented 4 years ago

Gabriel,

The best resource for creating a market map for any of the MUGA platforms is probably this repository from Karl Broman: https://github.com/kbroman/MUGAarrays

He has created files with chromosome, physical position, genetic position, ref and alt alleles (and strand with respect to reference genome) for MiniMUGA. You should be able to use that to make the maker map required by argyle.

I find the reporting of strandedness from Neogen to be quite confusing.

APM

On Jul 17, 2020, at 7:05 AM, Gabriel-Kirchgraber notifications@github.com wrote:

Hi, I had the same problem as a previous poster with the SNP_Map file provided by Neogene, it had the TOP/BOTTOM customer Strand etc. encoding for reference and alt alleles. I followed your advice and constructed them myself from my samples (31), but for some SNPs there is no alternative allele, it's the same in all samples, so I get the above warning. Does this pose a problem for further analysis? For example if I start comparing to the consensus calls of other strains/substrains where an alternative allele exists. Maybe I should construct new ref/alt allele map with the consensus calls? I am doing the new miniMuga v2 assay and have consensus calls for ca. 300 strains/substrains. Best, Gabriel.

— You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub, or unsubscribe.

Gabriel-Kirchgraber commented 4 years ago

Thanks! I will check it out! I also kind of despaired trying to figure out how the Neogene strand designations worked and was quite relieved to discover that even the author of the tool didn't get it ;-).