Open flannsmith opened 3 years ago
Hi, some contigs in the boundary file are remove if viral gene % is less than cellular gene %. I do not recommend including them since they are more likely to be false positive hits unless you can verify in other ways.
Those without confidence scores are short contigs with less than 2 complete genes but have hallmark genes. In you case, those hallmark genes are from dsDNAphage group.
@jiarong Just seeing your comment now for some reason. Thanks for your response!
Hi I'm just wondering why there would be less contigs appearing in the final-viral-score file (9808) as opposed to 12113 in the final-viral-boundary file? Can I include with confidence that all seqname's identified in the final-viral-boundary file are viral?
Also there are a number of lines in the viral-score file which are empty or don't include the % of confidence vote for each viral species but are ultimately deemed as dsDNAphage phage. Is that normal or should I filter these out?
Any insight much appreciated! Thanks.