broadinstitute / epi-SHARE-seq-pipeline

Epigenomics Program pipeline to analyze SHARE-seq data.
MIT License
17 stars 3 forks source link

Bam to raw fastq fix #143

Closed mei-knudson closed 1 year ago

mei-knudson commented 1 year ago

Changes:

  1. R3 right shift was being checked against exact dictionary; now being checked against mismatch dictionary
  2. Adding barcode correction type QC output for correct_fastq.py
  3. Making dictionary of valid correction combinations; barcodes that do not have valid correction combinations are counted as mismatches
    • Since we know which correction combinations we want to keep, should this dictionary be preset?
    • Should barcodes without valid correction combinations be counted in their own category instead of as nonmatches?