Closed y9c closed 2 years ago
A more specific case, which is not often.
(R1-->) 5'-ATTATTTAGAATATATTTAATTTTAAA-3'
|..|...||||.|||...|||.|.|||
5'-GTCTATGGTCATACCACCCTGAACACACCCAATCTCAAATGAGGGGTGGTTAGTAATTGGATGGGAGGATTTGTCCAGTCC-3'
3'-TTTACTCTCCACCAATCATAAA-5' (<--R2)
||||||| ||||||||||||||
3'-CAGATACCAGTATGGTGGGACTTGTGTGGGTTAGAGTTTACTCCCCACCAATCATTAACCTACCCTCCTAAACAGGTCAGG-5'
There is one C->T mutation on read2, although it will be less mismatch when mark read2 as negative strand (YZ:A:-). But I think it will be more reasonable to mark read2 as positive strand (+) and record one G->A mismatch. Because we have the prior knowledge from R1 and the strandness of library.
Hello @y9c ,
Thank you very much for using HISAT-3N
and your suggestion. I think making directional mapping for strand specific reads is a good idea. I made a new option for HISAT-3N
. HISAT-3N
can make directional mapping with the option --directional-mapping
. This mode should be about 2x faster than standard non-directional mapping, but it only works for strand specific library. Please check the hisat-3n_directional branch for this test version.
Best, Leo
@y9c
I just merge the directional mapping branch to hisat-3n branch. Now HISAT-3N
supports directional-mapping. Thank you for your request. Please check our website for more detail.
Thanks!
Hi @imzhangyun ,
I would like to know if you can add an argument for --directional-mapping
to switch the strand orientation?
Take the Takara SMARTer kit as an example, the R1 in reverse order. This might cause some problem in using this setting.
Thanks!
Hello @y9c,
I wonder if you can exchange the read1.fq and read2.fq?
Best, Leo
Thanks. I mean single end sequencing. I only have read 1
Hi, was this issue resolved? I do not see the reverse mapping as an option on the most updated hisat3n version. Also can't find the dev branch where it is supposed to be.
Thanks!
It seem that hisat2-3n detect the strand of a sequence by the number of conversion, and assign a YZ tag for each read accordingly. This works for most of the situation, such as the example bellow (a bisulfite sequencing data). There are more C->T conversions on read 1, thus it is marked as positive strand (+). There are more G->A conversion on read2, meanwhile read 2 is reverse direction, thus it is also marked as positive strand (+).
But this might cause some problem in this two situations,
We know that most of libraries are constructed in a strand-specific way, eg, RNA bisulfite libraries for detecting m5C modification, DNA libraries constructed by Swift kit etc. p7 adapter are ligated into the 5' end of single stranded RNA/DNA after conversion (say, C->T). Thus, the read1 should have the same mutation (C->T) as the conversion.
I would like to know is it possible to make use of this information and let hisat2-3n output the YZ tag based on the reads direction rather than the conversion number?
Similar issue might be: isssue #346
Thanks!