Discrepancy in Output Counts in Genomad

apcamargo / genomad

geNomad: Identification of mobile genetic elements

Other

193 stars 19 forks source link

The summary files should only include sequences classified as viruses (<prefix>_virus_summary.tsv) or plasmids (<prefix>_plasmid_summary.tsv). Sequences not present in the summary were either not classified as viruses or plasmids, or they were classified but didn't pass the post-classification filters. These filters can be disabled by using the --relaxed flag.

The taxonomy file only contains sequences that were assigned to a taxon. Sequences missing from this file did not match any taxonomically-informative markers. If you expected all sequences to match a marker, you can try increasing the search sensitivity (e.g., -s 7), but this will increase execution time and memory usage.

apcamargo / genomad

Discrepancy in Output Counts in Genomad #119