Closed aarbduarte closed 4 years ago
Hi @aarbduarte
there is another similar problem #975 .
The reads look fine, so it's puzzling. You got the fastq files from SRA, without any postprocessing?
The only thing I can think of is that there is some sort of invisible character that creates an empty right after
@SRR1070260.47317756.1
Can you extract these 3+3 reads to files (not copypaste from the screen), and map them?
Cheers Alex
Hi @aarbduarte
these files map without any problem. So you see the error if you map them? If not, the next test I would recommend is mapping the original files, without trimming - this will test whether the trimming somehow screwed up with fastq files.
Cheers Alex
Hi @alexdobin , thank you for the quick reply.
when running pre-cutadapt files it delivers the same error.
`$STAR --runThreadN 60 \
--genomeDir /home/aduarte/STAR_refe_files \ --readFilesIn ./bams/SRR1070260_1.fastq.gz ./bams/SRR1070260_2.fastq.gz \ --twopassMode Basic \ --outSAMstrandField intronMotif \ --readFilesCommand zcat \ --outSAMtype BAM Unsorted \ --outFileNamePrefix ./bams/SRR1070260. Aug 04 15:14:28 ..... started STAR run Aug 04 15:14:28 ..... loading genome Aug 04 15:16:24 ..... started 1st pass mapping
EXITING because of FATAL ERROR in reads input: short read sequence line: 0 Read Name=@SRR1070260.47317756.1 Read Sequence==== DEF_readNameLengthMax=50000 DEF_readSeqLengthMax=650
Aug 04 15:19:53 ...... FATAL ERROR, exiting
gzip: stdout: Broken pipe`
Thank you for your time
Hi @aarbduarte
weird, not sure what's going on. Which STAR version you are using? Please try the latest 2.7.5b , both the static and dynamic executables, and compiled from source. I will download this sample to see if I can reproduce the problem.
Cheers Alex
Aah, I cannot get this sample, it's dbGapped. So you would need to try different things on your side. A few more suggestions:
Rerun using recommended settings, still delivers the same error
Hi @aarbduarte
this is going to be a hard one to debug... :( A couple of more things to try:
grep -A3 -v ^@SRR1070260.47317756.1 fastq1
grep -A3 -v ^@SRR1070260.47317756.2 fastq2
Cheers Alex
I isolated the reads and they were mapped correctly. Weird issue but solved. Thanks!
Hi,
Thank you for all the effort, STAR is a fantastic tool.
I've been running STAR for 92 paired-end RNA-seq samples and at a particular one I always retrieve the same error:
I've tried to run it gzipped and unzipped. The command I use is
$STAR --runThreadN 60 --genomeDir /home/aduarte/STAR_refe_files --readFilesIn ./bams/SRR1070260_1.fastq ./bams/SRR1070260_2.fastq --twopassMode Basic --outSAMstrandField intronMotif --outSAMtype BAM Unsorted --outFileNamePrefix ./bams/SRR1070260.
(with --readFilesCommand zcat when dealing with .gz files)
the lines in question are:
and
Any idea?
Thank you for your time