ktmeaton / ncov-recombinant

Reproducible workflow for SARS-CoV-2 recombinant sequence detection.
MIT License
18 stars 2 forks source link

Conflicting results between parent_clade and parent_lineage #205

Closed ktmeaton closed 1 year ago

ktmeaton commented 1 year ago

Conflict has been observed between the populations reported between parent_clade and parent_lineage. Example:

Note that the first parent is contested, as the clade is reported to be BA.4, while the most likely lineage is BA.5.2.1*. This conflict is also seen in the first parent's confidence, which is low at 0.28.

To draw attention to this, I could create a new field called parent_conflict. If a parent_lineage is not a descendant of the corresponding parent_lineage, this field could be set to true.

At first, this will just be a field for exploratory purposes. But if it appears to help with identifying false positives, this could be a default filter to excluded samples with parent conflict.