Hi Alex,
I used STARsolo (version=2.7.10a) to align scRNA-seq reads to hg38 genome.
$STAR --runThreadN $thread \
--genomeDir STAR \
--readFilesIn raw/${sample}_2.fastq.gz raw/${sample}_1.fastq.gz \
--readFilesCommand zcat \
--outFileNamePrefix mapping/${sample}. \
--outSAMtype BAM SortedByCoordinate \
--outSAMattributes NH HI AS NM nM MD GX GN CR CY UR UY CB UB sS sQ sM \
--soloType CB_UMI_Simple --soloCBwhitelist None \
--soloCBstart 1 --soloCBlen 20 --soloUMIstart 21 --soloUMIlen 8 \
--soloStrand Reverse --soloFeatures Gene --quantMode GeneCounts \
--outFilterScoreMinOverLread 0.3 --outFilterMatchNminOverLread 0.3 \
--soloCellFilter EmptyDrops_CR 3000 0.99 10 45000 90000 300 0.01 20000 0.01 10000
In the output BAM, there are some alignments without GX tag, like:
A01886:446:HVJL7DSX7:3:2505:26467:23249 272 chr1 11868 1 72M * 0 0 CGTTAACTTGCCGTCAGCCTTTTCTTTGACCTCTTCTTTCTGTTCGTGTGTATTTGCTGTCTCTTGG CCCAG FFFFFFF,FFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFF:FFFFFFFFFFFFFFFFFFFFF NH:i:4 HI:i:3 AS:i:66 NM:i:2 nM:i:2 MD:Z:45A19A6 CR:Z:TGCGCAGGTCCAACTGCTCT CY:Z:IIIIIIIIIIIIIIIIIIII UR:Z:ATGTTGGA UY:Z:FFF,FFFF sS:Z:TGCGCAGGTCCAACTGCTCTATGTTGGA sQ:Z:IIIIIIIIIIIIIIIIIIIIFFF,FFFF sM:i:0 CB:Z:TGCG CAGGTCCAACTGCTCT UB:Z:ATGTTGGA
Additional, there are some alignments with GX tag as GX:Z:-.
Hi Alex, I used STARsolo (version=2.7.10a) to align scRNA-seq reads to hg38 genome.
In the output BAM, there are some alignments without GX tag, like:
A01886:446:HVJL7DSX7:3:2505:26467:23249 272 chr1 11868 1 72M * 0 0 CGTTAACTTGCCGTCAGCCTTTTCTTTGACCTCTTCTTTCTGTTCGTGTGTATTTGCTGTCTCTTGG CCCAG FFFFFFF,FFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFF:FFFFFFFFFFFFFFFFFFFFF NH:i:4 HI:i:3 AS:i:66 NM:i:2 nM:i:2 MD:Z:45A19A6 CR:Z:TGCGCAGGTCCAACTGCTCT CY:Z:IIIIIIIIIIIIIIIIIIII UR:Z:ATGTTGGA UY:Z:FFF,FFFF sS:Z:TGCGCAGGTCCAACTGCTCTATGTTGGA sQ:Z:IIIIIIIIIIIIIIIIIIIIFFF,FFFF sM:i:0 CB:Z:TGCG CAGGTCCAACTGCTCT UB:Z:ATGTTGGA
Additional, there are some alignments with GX tag asGX:Z:-
.What are the differences between them?
Best Yan