I would like to assemble the reads that do not map with my genome using trinity, and this output is incompatible with it. This software runs seqtk previously and gives the following error:
Converting input files. (both directions in parallel)CMD: seqtk-trinity seq -A -R 1 <(gunzip -c /data/input/SRR937558_Unmapped_1.fastq.gz) >> left.fa
CMD: seqtk-trinity seq -A -R 2 <(gunzip -c /data/input/SRR937558_Unmapped_2.fastq.gz) >> right.fa
Error, found read_type 1 but expecting read_type 2
And this doesn't happen if I manually change the 0 for 1 and the 1 for 2 (for a subset of 6 sequences). My new fastq:
marta@cyanobacteria:~/Downloads/CASES/Lorenzo_Federico/star_test_single$ head -n4 6seqs_1.fastq
@SRR937558.2 1:N:
AGACACTTCTGGGAAACTAGGTGATATATCTGTTAAAAATGTTAACAGACAGCCTGACCTCTAAGTGTGATTTCTCAAGACTTGATTTATCACCACCAGGA
+
CCCFFFFFHHHHHJJJJJJJJEHIJJJJJJJJJJJJJGJJJIIJJJJJJJJJJJJJJJJJJJJIIGHHGFIIIIJHJGIHHHHGGFFFFFFFEEEDCDDD=
marta@cyanobacteria:~/Downloads/CASES/Lorenzo_Federico/star_test_single$ head -n4 6seqs_2.fastq
@SRR937558.2 2:N:
GTATTAAAATATCTCTGGAAAGTTATTCTAAGAAATATGTTTGGCTCAGAATCCTACATCCTGGTGGTGATAAATCAAGTCTTGAGAAATCACACTTAGAG
+
CBCFFFFFHHHHHJJJJJJJJJIIIJJJJJJJJJJJJJJIJJJJJJJJJJHIJJJJJIJJJJJIIIJGHIIJJJIJJJJEHHFHHHFFFFFDEEEEEDDDD
There is no particular purpose in any characters after the space in the read ID line.
You can safely delete them or rename them to match the expectations of the downstream tools.
Hi STAR team!
I was aligning some reads with the following command :
And I've realized that the output FASTQ of the unmapped reds have 0 and 1 in the "pair member" part of the header instead of 1 and 2. Example header:
My header:
I would like to assemble the reads that do not map with my genome using trinity, and this output is incompatible with it. This software runs seqtk previously and gives the following error:
And this doesn't happen if I manually change the 0 for 1 and the 1 for 2 (for a subset of 6 sequences). My new fastq:
And now the error doesn't occur:
So my question is, is there a reason to use the 0 and 1 instead of 1 and 2? is there an option to change the output fastq format?
Kind regards, Marta.