Closed nick-youngblut closed 3 years ago
Hello, It sounds like there is one issue and one feature request here. The issue is that TIPP isn't working on FASTQ-formatted files (throws a BLAST error). The feature request is that TIPP accept gzip'ed files (either FASTA or FASTQ) as input. Is this correct? For the issue, could you please send a small example file. Thank you!
The bug is that the TIPP documentation states that it can accept fastq files as input, but it appears that it can't. Any Illumina fastq file from the ENA or SRA could be used for testing that.
The other issue is that TIPP seems to require a specific naming of the sequence headers, but TIPP does not edit the header IDs, and instead just throws errors if the headers are formatted incorrectly. The fastq file(s) used for Issue 1 can be used for this 2nd issue.
As you note, this is a follow-up on the issue about what file formats are allowed as input. We now clarify in the tutorial that "The input fragment files must be in a format accepted by BLAST (i.e. a decompressed FASTA file with no spaces in the read names)."
I'm running
run_abundance.py
in tipp2, and the tipp2 tutorial states:However, if I run
run_abundance.py
with my read file in fasta format (reads.fq
orreads.fastq
), I get the following:If I convert those reads to a fasta (
reads.fas
),run_abundance.py
completes successfully.It appears that
run_abundance.py
can't actually use a fastq.It would be nice if TIPP2 could read in gzip'ed files, given that many/most users keep reads compressed.