hartleys / QoRTs

Quality of RNA-Seq Toolset
52 stars 14 forks source link

Hisat2 support #44

Closed ctxchris closed 7 years ago

ctxchris commented 7 years ago

Hi,

Hisat2 has become a popular RNAseq aligner and shows (at least in my experience) slightly more consistent results as STAR while also being faster. The encoding for multi-mapping reads seems to be non-standard, however, and I either get about 99% "DROPPED_NOT_UNIQUE_ALIGNMENT"reads or 0% when I use --keepMultiMapped.

Could you add support for Hisat2?

Thanks, Chris

hartleys commented 7 years ago

There is unfortunately no "standard" way to mark non-unique-aligned reads. Almost all the aligners that I am aware of use the MAPQ column to indicate multi mapping in some way, but every aligner uses its own coding. STAR marks all unique-aligned reads as 255. Tophat uses 50.

I haven't actually used HISAT2, but according to some docs I found online it looks like HISAT2 (v2.0.4 and above) uses a MAPQ of 60 to indicate unique mapping status.

So just add the parameter "--minMAPQ 60".

Let me know if this works, and I'll update the docs with a note.

ctxchris commented 7 years ago

Seems to work fine.

Thanks, Chris