Consistency of read quality has vastly improved over time. Now it is standard to get reads that have an extremely high quality score all the way throughout, while several years ago it was common to have read quality decrease towards the end of a read. As part of the mapping pipeline I am quality trimming -q 20, so these differences should not affect overall results. However, it would be good to look at overall read qualities and see how they differ among samples.
Questions and Tasks
[ ] Plot distribution of read quality.
FASTQC calculates the distribution of per base scores. Just average across deciles and plot the distribution of deciles across samples.
[ ] Do runs within an SRX have a similar quality score?
[ ] Is there a clear cutoff (fraction of bases) that should be implemented?
I forgot that I removed Fastqc from the workflow, because it was adding little value. I am going to close this and will re-open if I determine that Fastqc results are essential.
Story
Consistency of read quality has vastly improved over time. Now it is standard to get reads that have an extremely high quality score all the way throughout, while several years ago it was common to have read quality decrease towards the end of a read. As part of the mapping pipeline I am quality trimming
-q 20
, so these differences should not affect overall results. However, it would be good to look at overall read qualities and see how they differ among samples.Questions and Tasks
Definition of done