I got the output files from the prediction mode run, but I do not know how to interpret them.
Which file to look at for true variants? SSeq.Classified.sSNV.tsv or SSeq.Classified.sSNV.vcf?
I checked the SSeq.Classified.sSNV.tsv file's "TrueVariant_or_False" column, and it only has "nan" values. How could I fix this?
The number of variants in SSeq.Classified.sSNV.vcf is exactly the same as in consensus.vcf. Why is that? I assume SomaticSeq will filter out the false positive?
SSeq.Classified.sSNV.tsv with SCORE>0.7 (even better if >0.9) or SSeq.Classified.sSNV.vcf with PASS label are considered high-confidence variant calls.
TrueVariant_or_False is usually nan unless it's training data with truth file attached to it. So nothing wrong with it.
By default, every record in the .tsv file is written into the .vcf files for completeness. Filter for "PASS" label for high-confidence calls.
Hi,
I got the output files from the prediction mode run, but I do not know how to interpret them.
Which file to look at for true variants? SSeq.Classified.sSNV.tsv or SSeq.Classified.sSNV.vcf?
I checked the SSeq.Classified.sSNV.tsv file's "TrueVariant_or_False" column, and it only has "nan" values. How could I fix this?
The number of variants in SSeq.Classified.sSNV.vcf is exactly the same as in consensus.vcf. Why is that? I assume SomaticSeq will filter out the false positive?
Thanks!