Closed timz0605 closed 10 months ago
The zotus.fasta file contains approx. 2000 sequences of a 313 bases long partial COI gene region.
Hi and thanks for reporting this problem.
I agree that the size of the output file seems too big.
Are you sure that vsearch has properly parsed the OTU and sample identifiers from your input files?
It is difficult to pinpoint the what is wrong without more information. If possible, could you provide e.g. the vsearch version and computer platform, the exact sizes of the files, number of sequences, etc.
If possible, extracts with a small portion of the input files and result files (e.g. the first few lines) could be valuable. If you do not want to publish it here, you could alternatively send it to me by email.
No response - closing.
The command I use is
I am processing COI metabarcoding data for ~50 sites. I have finished all the merging, filtering, denoising, etc., steps, and the zotus.fasta file is my ZOTU database for this study. However, I encountered an issue when trying to generate the OTU frequency table
The size of combined.fasta is approximately 500 MB, and the size of the zotus.fasta is about 600 KB. However, the process was taking so long and the size of the zotutable.txt generated was somewhere around 30GB (which obviously does not make sense). I am wondering if you could take a look and see why I might encountered this error.