Clarification on reference allele normalization policy

ga4gh / vrs

Extensible specification for representing and uniquely identifying biological sequence variation

Apache License 2.0

80 stars 34 forks source link

Hi @Mrinal-Thomas-Epic!

Yes, this was discussed a long time ago. I had also advocated for this approach, and was asked to prove that changes would not impact non-reference alleles. I put in the time to do so, the notebook is here, and the approach is technically sound.

In the end, the argument that @reece made was that when describing reference alleles, the intent of ref-agree calls is different than variant calls, and for ref-agree we should not assume that ambiguity correction is needed or desired. As in all cases where the decision was not obvious, @reece @larrybabb and I presented to the community, assessed the arguments and community feedback, and then documented our rationale for the majority opinion in the spec; in this case, the majority opinion was that ref allele expansion should not be the default normalization behavior.

I wish I had a more satisfying answer. If you have some example data that demonstrates how adjusting the ref-agree behavior in the normalization algorithm would be beneficial to you, I am open to revisiting this decision for VRS 2.x.

ga4gh / vrs

Clarification on reference allele normalization policy #468