Closed alexmondaini closed 3 years ago
R1 strand error 403
means that the script encountered a read whose flag is unexpected (403), which means that it isn't a primary alignment. Per the eCLIP protocol, only uniquely mapped reads should pass the genome mapping step (STAR --outFilterMultimapNmax 1
)
@byee4 thanks Brian. One curiosity and I close the issue, as per the eCLIP protocol you use samtools view -f 128
to output only the second read in each pair for use with a single stranded peak caller. Why is that ? Why not conserving both reads or taking the first one only ? Thanks a lot.
Hi Alex, good question. I'll have to refer to the original paper which describes the paired-end protocol and is available here: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4887338/
cluster identification was performed on usable reads using CLIPper (available at https://github.com/YeoLab/clipper/releases/tag/1.0) with options –s hg19 –o –bonferroni –superlocal --threshold-method binomial --save-pickle, considering read 2 only (the read that is enriched for termination at the crosslink site). For visualization on the UCSC Genome Browser, all tracks were RPM (reads per million) normalized against the total number of usable reads in that dataset.
Thanks a lot Brian. Actually I'm trying to adapt your workflow for a slightly different scenario we want to experiment. I have to admit that I kept both reads instead of only the second one this time and I got file results from merge_peaks,that seemed to work 👍 .
However when I look at the logs I see R1 strand error 99
and R1 strand error 83
. I'm wondering if /opt/merge_peaks/bin/perl/overlap_peakfi_with_bam.pl
can accept an analysis on both reads instead of only the second one ? And if not can at least the first read be used instead of the second ? or only the second one is the right thing to do ? Thanks.
Sorry, I'm not really sure how to support other use cases, but the R1/2 strand error is simply checking for very specific flags. For eCLIP, we know what kinds of reads and what flags we can expect, so that's what we check for starting on line 308:
I'm getting the following issue
Command is:
My yaml is:
And images used are: