Open gnilihzeux opened 5 months ago
Hi @gnilihzeux
I would recommend exploring the reads that were mapped by bowtie2 and not mapped by STAR.
@alexdobin Yes, I seemed have found what happened to unmapped reads, of which most are palindrome sequence beween Read1 and Read2.
However, I have not found a solution to this problem yet.
Some sequences are listed as follows
>@illumina:8501:1210 mate1
GAGGCATTTGGCTACCTTAAGAGAGTCATAGTTACTCCCGCCGTTTACCCGCGCTTCATTGAATTTCTTCACTTTG
>@illumina:8501:1210 mate2
CAAAGTGAAGAAATTCAATGAAGCGCGGGTAAACGGCGGGAGTAACTATGACTCTCTTAAGGTAGCCAAATGCCTC
>@illumina:36606:2009 mate1
AGCCGTCCCGGAGCCGGTCGCGGCGCACCGCCGCGGTGGAAATGCGCCCGGCGGCGGCCGGTCGCCGGTCGGGGGACGGTCCCCCGCCGACCCCACCCCCGGCCCCGCCCGCCCACCCCCGCACCCGCCGGAGCCCGCCCCCTCCGGGGA
>@illumina:36606:2009 mate2
GGCCGTGTCGGCGGCCCGGCGGATCTTTCCCGCCCCCCGTTCCTCCCGACCCCTCCACCCGCCCTCCCTTCCCCCGCCGCCCCTCCTCCTCCTCCCCGGAGGGGGCGGGCTCCGGCGGGTGCGGGGGTGGGCGGGCGGGGCCGGGGGTGG
>@illumina:36347:2009 mate1
ATCGGCGAGTGCTGCTGCCGGGGGGGCTGTAACACTCGGGGGGGGTTTCGGTCCCGCCGCCGCCGCCGCCGCCGCCACCGCCGCCGCGAGGGGGGGGGAATCA
>@illumina:36347:2009 mate2
TGATTCCCCCCCCCTCGCGGCGGCGGTGGCGGCGGCGGCGGCGGCGGCGGGACCGAAACCCCCCCCGAGTGTTACAGCCCCCCCGGCAGCAGCACTCGCCGAT
>@illumina:49804:3788 mate1
GTAGTTCACCATCTTTCGGGTCCTAACACGTGCGCTCGTGCTCCACCTCCCCGGCGCGGCGGGCGAGACGGGCCGGTGGTGCGCCCTCGGCGGACTGGAGAGGCATCGGGATCCCACCTCGGGAAGCG
>@illumina:49804:3788 mate2
CAAGGAGTCTAACACGTGCGCGAGTCGGGGGCTCGCACGAAAGCCGCCGTGGCGCAATGAAGGTGAAGGCCGGCGCGCTCGCCGGCCGAGGTGGGATCCCGAGGCCTCTCCAGTCCGCCGAGGGCGCACCACCGGCCCGTCTCGCCCGCC
>@illumina:38521:29680 mate1
GTTTCGGTCCCGCCGCCGCCGCCGCCGCCGCCACCGCCGCCGCCGCCGCCGCCCCGACCCGCGCGCCCTCCCGAGGGAGGACGCGGGGCCGGGGGGCGGAGACGGGGGAGGAGGAGGACGGACGGACGGACGGACGGGGCCCCCCGAGCC
>@illumina:38521:29680 mate2
GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG
>@illumina:46639:29680 mate1
TACTATTCAAAGTTCTTTTCAACTTTCCCTTACGGTACTTGTTGACTCCC
>@illumina:46639:29680 mate2
GGGAGTCAACAAGTACCGTAAGGGAAAGTTGAAAAGAACTTTGAATAGTA
>@illumina:48673:29712 mate1
CCCATTTAAAGTTTGAGAATAGGTTGAGATCGTTTTCGGCCCCAAGACCTCTAATCNTTCGCTTTACCGGATAAAACTGCGTGGCGGGGGTGCGTCGGGTCTGCGAGAGCGCCAGCTATCCTGAGGGAAACTTCGGAGGGAACCAGCTAC
>@illumina:48673:29712 mate2
GAAACTCTGGTGGAGGTCCGTAGCGGTCCTGACGTGCAAATCGGTCGTCCGACCTGGGTATAGGGGCNAAAGACTAATCGAACCATCTAGTAGCTGGTTCCCTCCGAAGTTTCCCTCAGGATAGCTNGCGCTCTCGCAGACCCGACGCAC
Dear author, There are very high ratio unmapped reads for 'too short' and 'other' while mapping to
T2T chrm13
genome, but it worked for hg19 genome. BWT, there is a 83% reads mapping to T2T with bowtie2. Our data is RNA-seq with ribosome fractions. Our group had modified some parameters related to repeats, including--winAnchorMultimapNmax
higer,--outFilterMultimapNmax
higher,--alignIntronMin 1
. But all tunes didn't work.What parameters should been set?
Thanks a lot.
The logs are follow: T2T
hg19