Closed lindsay-liang closed 4 years ago
Hi @lindsay-liang. Thanks for your detailed report! It seems that for some reason the script is not reading detecting the read pairs as "properly paired". Would you mind sharing a few lines of your alignment files to explore why this might be the case? Also, what aligner did you use?
Hi @areyesq89, sorry for the very late follow up on this; it turns out that my data was reverse sequenced as well (which I didn't know). Thanks so much for your response!
Hi,
I'm running the dexseq_count.py script on paired bulk RNA-seq data, trying to get read counts on an exon level. The first time I ran the script, I forgot to include the
-p yes
option for paired end sequencing. However when I ran it a second time, including the argument I got extremely low read counts, compared to the first one.Here are the results from DEXseq count for some of the exons without
-p yes
Here are the results from DEXseq count with
-p yes
Here are the read counts from samtools flagstat for each of the exons above:
samtools flagstat ENSG00000236283.5:032.bam
samtools flagstat ENSG00000236283.5:033.bam
samtools flagstat ENSG00000236283.5:034.bam
samtools flagstat ENSG00000236283.5:035.bam
samtools flagstat ENSG00000236283.5:036.bam
samtools flagstat ENSG00000236283.5:037.bam
samtools flagstat ENSG00000236283.5:038.bam
Here are the bedtools intersect results for the same exons:
It looks like the count for aligned R1 reads from samtools are somewhat close to those reported by DEXseq count without ‘-p yes`, but not quite the same. My instinct is just that the reads not counted by dexseq didn't pass the quality threshold. The counts reported by bedtools are the same as the total number of reads aligned reported by samtools.
Any ideas as why we're seeing such low counts with
-p yes
?