jiarong / VirSorter2

customizable pipeline to identify viral sequences from (meta)genomic data
GNU General Public License v2.0
227 stars 31 forks source link

Run virsort2 with multiple genomes #122

Open whitnersyrena opened 2 years ago

whitnersyrena commented 2 years ago

Hi there,

More of a general question, but I can't seem to locate this information. Is it possible to run virsorter2 on multiple genomes at the same time? I have 450 individual WGS fungal strains and wanted to screen them all individually.

Thanks!

jiarong commented 2 years ago

Yes, VS2 treats each sequence independently. You can concatenate the genomes (fasta format) and feed it as input, or run each genome sequence separately in parallel, which will be much faster.

matheuscrgarcia commented 2 years ago

Hello there, just use this command that it will run multiple sequences individually, but not at the same time. Also, this command will create an individual directory for each of the sequences:

for i in *.fna; do virsorter run -w VirSorter2results$i -i $i --jobs 10 --min-length 1000 --keep-original-seq --exclude-lt2gene --rm-tmpdir all; done

Just add the parameters that you like and go for it and do not delete this part 'rm-tmpdir'.