mozack / ubu

UNC-Chapel Hill Bioinformatics Utilities
23 stars 13 forks source link

ubu sam-xlate translate coordinates problem #3

Open guanghaoli opened 6 years ago

guanghaoli commented 6 years ago

When I used sam-xlate to translate the genome coordinates to transcriptome coordinates, the bam file was much bigger. The genome coordinates bam is about 3gb, while the transcriptome coordinate bam is about 20gb. I used samtools flagstat to statistic the reads number and found that the reads number is about 20 times bigger. Could you please tell me why this problem occurs and how to deal with it? genome bam flagstat: image transcriptome bam flagstat: image

mozack commented 6 years ago

Reads that map to a single position can potentially map to mutiple transcripts. All possible transcript mappings are output.