Closed peterjc closed 1 year ago
At this point we've dropped the <STEM>.all_reads.fasta
output in the pipeline, and the classify step can take either FASTA inputs (legacy), or sample-tally TSV, but still produces the old style TSV output.
Would next change the classifier to add columns to the sample-tally style TSV, and then make the summary accept that single file over the current pair of files.
This makes it slightly harder to convert the sample-tally TSV output into BIOM format as would have to drop the chimera column.
Plan is to next convert the classify step to take the sample-tally TSV output, and append its taxid and taxonomy columns to it as output, giving a single file for input to the summary command (which would attach any metadata and make the pretty reports).
As an aside, excerpt from running this on our main dataset: