Hi, zhangkai:
I am using SnapATAC2-v2.7.0
Due to experimental issues, I obtained a poor quality dataset where R1 consists entirely of fixed sequences, with R1 mapping accounting for only 6.74% of total R1 reads. R2 contains normal sequences, with R2 mapping accounting for 99.3% of total R2 reads. After performing paired-end alignment using bwa mem, I obtained a BAM file and used the following command:
I noticed that the sum of "frac_confidently_mapped" and "frac_unmapped" exceeds 1, which is clearly unreasonable. Could you please explain how "frac_confidently_mapped" and "frac_unmapped" are calculated respectively?
The definitions of these two metrics are from 10x website:
frac_confidently_mapped: Fraction of sequenced reads or read pairs with mapping quality >= 30, i.e., #q30 / #all reads.
frac_unmapped: Fraction of sequenced reads or read pairs that have a valid barcode but could not be mapped to the genome, i.e., 1 - #mapped barcoded pairs / #total barcoded pairs.
Hi, zhangkai: I am using SnapATAC2-v2.7.0 Due to experimental issues, I obtained a poor quality dataset where R1 consists entirely of fixed sequences, with R1 mapping accounting for only 6.74% of total R1 reads. R2 contains normal sequences, with R2 mapping accounting for 99.3% of total R2 reads. After performing paired-end alignment using bwa mem, I obtained a BAM file and used the following command:
This generated the following QC metrics:
I noticed that the sum of "frac_confidently_mapped" and "frac_unmapped" exceeds 1, which is clearly unreasonable. Could you please explain how "frac_confidently_mapped" and "frac_unmapped" are calculated respectively?
Thank you!