Open mdavy86 opened 9 years ago
To enable biologists and lab staff to contribute to qc efforts I've put together questions of interest to be asked from a GBS run. Technical details are left out to draw non-bifos in.
Thats good, many of the questions cover more detail than in the last meeting minutes.
We have some code investigating post aligning QC, fragment distributions, modeled as an exponential decay (where applicable), size selection bias relative to the population of known tag sites, depth distribution, reml mixed model analysis of 96 technical samples for 6 genotypes.
Link to fastq_screen, that utility that subsample reads in fastq files to check for contaminations against a configurable set of Bowtie2 genome indexes: http://www.bioinformatics.babraham.ac.uk/projects/fastq_screen/
This is a placeholder to discuss what we are doing in terms of Quality control of BBS data
Plant and Food Research
We have some
perl
scripts,knitr
Rmarkdown scripts, and ashiny
application looking at quality control aspects of GBS restriction sites forbam
alignments.The
shiny
application does some exploratory analysis summarizing 96 wells * 2 bam files for ~1.5 Million restriction sites/tags in real time checking thesampled
yield distributions versus the knownpopulation
of restriction sites for samples, investigatng coverage depth, and fragment distribution before considering SNP discovery.The perl script sanity checks restriction fragments (probably unnecessary), and summarises sites in the following form;