ls /dartfs-hpc/rc/home/w/f006f9w/Raw_data/*R1_001.fastq.gz | while read x; do
# save the file name
base_name=$(basename $x | sed 's/_R1_001\.fastq\.gz//')
echo ${base_name}
# run STAR for each sample
STAR --genomeDir /dartfs-hpc/rc/home/w/f006f9w/Genome_index \
--readFilesIn /dartfs-hpc/rc/home/w/f006f9w/Raw_data/${base_name}_R1_001.fastq.gz /dartfs-hpc/rc/home/w/f006f9w/Raw_data/${base_name}_R2_001.fastq.gz \
--readFilesCommand zcat \
--sjdbGTFfile /dartfs-hpc/rc/home/w/f006f9w/Homo_sapiens.GRCh38.110.gtf \
--runThreadN 16 \
--outSAMtype BAM SortedByCoordinate \
--outFilterType BySJout \
--outFileNamePrefix ${base_name}.
done
I have no errors that come up and am able to generate all the output files.
Then I try to use the picard Collect RNA-seq metrics tool using:
However, my .RNA_metrics file does not have any coding, UTR or intronic bases. It would be great if you could please let me know what the issue is here.
Hello,
I am using the STAR aligner to align bulk RNA-seq samples generated from Smart-Seq-2 protocol using the Nextera XT Dna library prep kit.
This is how I am generating the genome index and aligning my files: Genome Index:
Alignement
I have no errors that come up and am able to generate all the output files.
Then I try to use the picard Collect RNA-seq metrics tool using:
However, my .RNA_metrics file does not have any coding, UTR or intronic bases. It would be great if you could please let me know what the issue is here.
Thank you for the help!