Open erikyao opened 2 years ago
There are a few VCF normalization tools/algorithms that can be used in our _normalized_vcf function, e.g.:
_normalized_vcf
Our current implementation fails to meet the Parsimony definition of VCF normalization:
A variant is parsimonious if and only if it is represented in as few nucleotides as possible without an allele of length 0.
E.g. our current implementation cannot handle cases like ref = "TCCCCT", alt = "CCCCT". Normalized sequences should be ref = 'TC', alt = 'C'.
ref = "TCCCCT", alt = "CCCCT"
ref = 'TC', alt = 'C'
There are a few VCF normalization tools/algorithms that can be used in our
_normalized_vcf
function, e.g.:Our current implementation fails to meet the Parsimony definition of VCF normalization:
E.g. our current implementation cannot handle cases like
ref = "TCCCCT", alt = "CCCCT"
. Normalized sequences should beref = 'TC', alt = 'C'
.