nextstrain / nextclade_data

Datasets for https://github.com/nextstrain/nextclade
https://clades.nextstrain.org
32 stars 28 forks source link

Wrong assignment for 20A NCOV sequences #229

Open HadrienRegue opened 2 months ago

HadrienRegue commented 2 months ago

Dear nextstrain team,

First, thank you for your work!

We recently updatated our nextclade sars-cov-2 classic dataset (from 2024-06-13T23:42:47Z to 2024-07-17T12:57:03Z). however, our last sequencing positive control results are wrong using this dataset where 20A->20C. Our 20H control remain correctly assigned.

When using the mature protein sars-cov-2 dataset, our 20A positive control is correctly assigned.

Can you have a look on this problem? I can provide you our postive control sequences if needed.

Best regards,

Hadrien

corneliusroemer commented 2 months ago

Thanks for the report Hadrien!

I'll have a look - could you possibly send me the fasta of the sequence that changed in its assignment? that would simplify the checking.

You can also send it by email if you prefer that over sharing here (as txt)

On Thu, Sep 12, 2024, 12:35 Hadrien Regue @.***> wrote:

Dear nextstrain team,

First, thank you for your work!

We recently updatated our nextclade sars-cov-2 classic dataset (from 2024-06-13T23:42:47Z to 2024-07-17T12:57:03Z). however, our last sequencing positive control results are wrong using this dataset where 20A->20C. Our 20H control remain correctly assigned.

When using the mature protein sars-cov-2 dataset, our 20A positive control is correctly assigned.

Can you have a look on this problem? I can provide you our postive control sequences if needed.

Best regards,

Hadrien

— Reply to this email directly, view it on GitHub https://github.com/nextstrain/nextclade_data/issues/229, or unsubscribe https://github.com/notifications/unsubscribe-auth/AF77AQNEKFOROSXBJQRS2V3ZWFU7ZAVCNFSM6AAAAABOC5IZICVHI2DSMVQWIX3LMV43ASLTON2WKOZSGUZDEMBQHAYDQOA . You are receiving this because you are subscribed to this thread.Message ID: @.***>

HadrienRegue commented 2 months ago

Bellow are the 3 sequences generated from our last sequencing results.

Tpos_SARS-CoV-2.txt

Thank you,

Hadrien