cov-lineages / pango-designation

Repository for suggesting new lineages that should be added to the current scheme
Other
1.04k stars 97 forks source link

LINEAGE DETERMINATION ISSUE #1770

Closed noutin2020 closed 1 year ago

noutin2020 commented 1 year ago

After analysing the Whole Genome Sequence of a positive SARS-CoV-2 sample collected on 31/01/2023, I determined the lineage of the strain using both Nexclade and Pangolin The Output of Nexclade was: XBB.1.

The Output of Pangolin was: XBB.1.5(1/1) with the following note: scorpio lineage BA.2 conflicts with inference lineage XBB.1.5,4.2,v1.18.1.1

How can I interpret this note and make final decision on the lineage of this variant?

Your various responses will be highly appreciated! I look forward to hearing from you.

Regards, Noutin

carlottaolivero commented 1 year ago

Hi @noutin2020,

I had the same issue and @AngieHinrichs helped me with this: https://github.com/cov-lineages/pangolin/issues/508#issuecomment-1432180430

Hope this helps!!

corneliusroemer commented 1 year ago

Hi @noutin2020, it can happen that Usher and Nextclade show somewhat different results. In this case, the difference between XBB.1 and XBB.1.5 is small and depends on just a single to a few nucleotide substitutions.

When the sequence is incomplete (having some Ns), it is possible that the exact lineage cannot be determined with certainty. Likewise, sometimes there are bioinformatic analysis artefacts that can make it difficult for the algorithms to be certain about which exact lineage it is.

If you share the sequence I could have a look and see why the two tools disagree (slightly).

Based on the sampling date and location you provided, it is quite likely that the sample is XBB.1.5 (or another XBB with S:F486P), see covSpectrum graphic showing proportion of XBB* that is XBB.1.5 in Africa in the past 3 months:

image https://cov-spectrum.org/explore/Africa/AllSamples/Past3M/variants?nextcladePangoLineage=XBB*&nextcladePangoLineage1=XBB.1.5*&analysisMode=CompareToBaseline&

noutin2020 commented 1 year ago

Hi @noutin2020, it can happen that Usher and Nextclade show somewhat different results. In this case, the difference between XBB.1 and XBB.1.5 is small and depends on just a single to a few nucleotide substitutions.

When the sequence is incomplete (having some Ns), it is possible that the exact lineage cannot be determined with certainty. Likewise, sometimes there are bioinformatic analysis artefacts that can make it difficult for the algorithms to be certain about which exact lineage it is.

If you share the sequence I could have a look and see why the two tools disagree (slightly).

Based on the sampling date and location you provided, it is quite likely that the sample is XBB.1.5 (or another XBB with S:F486P), see covSpectrum graphic showing proportion of XBB* that is XBB.1.5 in Africa in the past 3 months:

image https://cov-spectrum.org/explore/Africa/AllSamples/Past3M/variants?nextcladePangoLineage=XBB*&nextcladePangoLineage1=XBB.1.5*&analysisMode=CompareToBaseline&

Hello @corneliusroemer, Thank you so much for your clarifications. The outputs of Usher and Nexclade are the same. This sequence was deposited in GISAID and the accession number is EPI_ISL_17182746. The genome of this strain contains only 9 missing nucleotides and its length is 29491. Should this be considered as incomplete genome? I do not think so because the strain EPI_ISL_172407715 contains 604 Ns and was classified as XBB.1.5.

In addition, it has S:F486P mutation. https://www.epicov.org/epi3/frontend#1903dc

Since you mentioned that it could be another XBB with S:F486P, which one could it be? Below is the accession number of the sequence as requested: EPI_ISL_17182746

I look forward to hearing from you.

image

FedeGueli commented 1 year ago

Hi @noutin2020, it can happen that Usher and Nextclade show somewhat different results. In this case, the difference between XBB.1 and XBB.1.5 is small and depends on just a single to a few nucleotide substitutions. When the sequence is incomplete (having some Ns), it is possible that the exact lineage cannot be determined with certainty. Likewise, sometimes there are bioinformatic analysis artefacts that can make it difficult for the algorithms to be certain about which exact lineage it is. If you share the sequence I could have a look and see why the two tools disagree (slightly). Based on the sampling date and location you provided, it is quite likely that the sample is XBB.1.5 (or another XBB with S:F486P), see covSpectrum graphic showing proportion of XBB that is XBB.1.5 in Africa in the past 3 months: image https://cov-spectrum.org/explore/Africa/AllSamples/Past3M/variants?nextcladePangoLineage=XBB&nextcladePangoLineage1=XBB.1.5*&analysisMode=CompareToBaseline&

Hello @corneliusroemer, Thank you so much for your clarifications. The outputs of Usher and Nexclade are the same. This sequence was deposited in GISAID and the accession number is EPI_ISL_17182746. The genome of this strain contains only 9 missing nucleotides and its length is 29491. Should this be considered as incomplete genome? I do not think so because the strain EPI_ISL_172407715 contains 604 Ns and was classified as XBB.1.5.

In addition, it has S:F486P mutation. https://www.epicov.org/epi3/frontend#1903dc

Since you mentioned that it could be another XBB with S:F486P, which one could it be? Below is the accession number of the sequence as requested: EPI_ISL_17182746

I look forward to hearing from you.

image

Hi the one on left is XBB.1.5 , the other on the right is part of a fast undesignated lineage that emerged in africa proposed in #1712

corneliusroemer commented 1 year ago

It's an XBB.1 with S:F486P - but without the T17124C usually found in XBB.1.5 (though not in 100%). So it could be XBB.1.5, but could also be an independent acquisition of S:F486P.

Nextclade calls it XBB.1.5 because it does fit the broad definition: XBB.1 + S:F486P. Though this could be a false positive.

image

Usher finds this in a cluster of XBB.1 with S:F486P that is not labelled XBB.1.5. I'll have to study this part of the tree more closely to see whether this should be XBB.1.5 or is a genuinely independent XBB.1 + S:F486P. There could already be an issue for this.

image

https://next.nextstrain.org/fetch/genome-test.gi.ucsc.edu/trash/ct/subtreeAuspice1_genome_test_5ec3d_5e4250.json

FedeGueli commented 1 year ago

@corneliusroemer it is part of #1712

(i strongly support designation of it, second fastest in the world after XBB.1.16)

noutin2020 commented 1 year ago

Very insightful discussion and thank you so much for your time.

@corneliusroemer , it is not a false positive because fresh sample with very good ct value. We have other similar results.

Independent acquisition of S:F486P?? I am curious, shed more light on this statement, please.

@FedeGueli, about your strong recommendation: (i strongly support designation of it, second fastest in the world after XBB.1.16)

Is it a general recommendation or specific to this sequence?

How can this designation be done?

FedeGueli commented 1 year ago

@FedeGueli, about your strong recommendation Is it a general recommendation or specific to this sequence?

How can this designation be done?

It was a message for Cormelius, only the pango team could do designations, and designations are always about a lineage not a singlet sequence as stated in #1 .

By the way now it has been designated XBB.1.17.1 so your sequence on the right belongs to this lineage.

noutin2020 commented 1 year ago

Thank you for concluding nicely this discussion.

Let's keep in touch!

On Sun, Mar 19, 2023, 14:41 Federico Gueli @.***> wrote:

@FedeGueli https://github.com/FedeGueli, about your strong recommendation Is it a general recommendation or specific to this sequence?

How can this designation be done?

It was a message for Cormelius, only the pango team could do designations, and designations are always about a lineage not a singlet sequence as stated in #1 https://github.com/cov-lineages/pango-designation/issues/1 .

By the way now it has been designated XBB.1.17.1 so your sequence on the right belongs to this lineage.

— Reply to this email directly, view it on GitHub https://github.com/cov-lineages/pango-designation/issues/1770#issuecomment-1475279612, or unsubscribe https://github.com/notifications/unsubscribe-auth/AQQAFAKN4QFYMBVREEXHTLDW44LJJANCNFSM6AAAAAAV6OOUFY . You are receiving this because you were mentioned.Message ID: @.***>

AnonymousUserUse commented 1 year ago

@noutin2020 Here "false positive" means, Nextclade assigns this sequence as XBB.1.5 but it does not real XBB.1.5. It refers to the algorithm of assigning PANGO lineage, not to the PCR test.

noutin2020 commented 1 year ago

Ok. Rather sequencing run (not PCR). Thanks