jiarong / VirSorter2

customizable pipeline to identify viral sequences from (meta)genomic data
GNU General Public License v2.0
225 stars 31 forks source link

About output results #30

Open maocheng2020 opened 3 years ago

maocheng2020 commented 3 years ago

Hi, I have two questions. My sample includes eukaryotes, will it affect the output result? The phage genome I am concerned about is 50-200kb in size.,Will the screening conditions “--min-length 3000” be too strict? Thanks, good wishes.

jiarong commented 3 years ago

The length cutoff of 3000 is not strict. Generally, the shorter contigs are , the less accurate the virus identification tools are. Anything below 5000, you can expect fair among of false positive in the results, further screen with score and other options or manual check is needed to remove those.