JianiC / betaCoV

nextstrain build for Beta coronavirus
0 stars 1 forks source link

COV Ancestral Sequence Reconstruction--test #2

Open JianiC opened 4 years ago

JianiC commented 4 years ago

Test

Data: COV19 (Human) Spike gene

Remove duplicate sequences and sequence with unambiguous characters (For AA mutation, n= 817)

Ancestal sequence Reconstruction

IQtree--> get the ancestral sequence for all of the internal node, and summarize the mutation with more node included
TreeTime (limited by the number of sequences)

Ancestral sequence estimation + Map the mutation to the tree annotated_tree nexus

summary statisticsThe TOTAL tree length is 1.593e-01 and 718 mutations were observed. Of these 718 mutations,

The TERMINAL branch length is 1.059e-01 and 520 mutations were observed. Of these 520 mutations,

Of the 3822 positions in the genome,

log-likelihood difference to Poisson distribution with same mean: -1.326e+02

The ten most homoplasic mutations are: mut multiplicity G1841A 24 G1841R 6 G3371K 6 C13T 3 C52T 3 C870T 3 G906T 3 C1623T 3 C1860T 3 C2367T 3 C2472T 3 G3371T 3 C13Y 2 C60T 2 C65T 2 C80T 2 C84T 2 C86T 2 C96Y 2 C145T 2 C149T 2 C210T 2 C227T 2 G268T 2 T458C 2 G542T 2 G663T 2 G773T 2 G922T 2 G925C 2

The ten most homoplasic mutation on terminal branches are: mut multiplicity G1841A 16 G1841R 6 G3371K 6 C52T 3 C870T 3 C13Y 2 C96Y 2 C210T 2 C227T 2 G906T 2 G922T 2 G925C 2 G1356T 2 A1493M 2 C1524Y 2 G1723T 2 G1864A 2 C2073T 2 C2076T 2 A2108G 2 C2147T 2 G2306T 2 C2367T 2 C2472Y 2 C2820T 2 G3366T 2 G3371T 2 C3444T 2 A3498G 2 G3573T 2

Map AA mutation (simply translate the nucleotide to Amino acid)

aa_annotated_tree nexus

summary statistics The TOTAL tree length is 1.593e-01 and 424 mutations were observed. Of these 424 mutations,

The TERMINAL branch length is 1.059e-01 and 302 mutations were observed. Of these 302 mutations,

Of the 1273 positions in the genome,

log-likelihood difference to Poisson distribution with same mean: -1.134e+02

The ten most homoplasic mutations are: mut multiplicity G614D 24 G614X 6 G1124X 6 L5F 3 L18F 3 G1124V 3 L5X 2 T22I 2 P26X 2 A27V 2 T29I 2 H49Y 2 S50L 2 L54F 2 T76I 2 V90F 2 M153T 2 M177I 2 G181V 2 W258L 2 E281X 2 V308L 2 E309Q 2 V367F 2 G476S 2 Q498X 2 A522S 2 A575S 2 E583X 2 V622I 2

The ten most homoplasic mutation on terminal branches are: mut multiplicity G614D 16 G614X 6 G1124X 6 L18F 3 L5X 2 P26X 2 T76I 2 M177I 2 E281X 2 V308L 2 E309Q 2 Q498X 2 A575S 2 E583X 2 V622I 2 T716I 2 G769V 2 G1124V 2 V1228L 2 P1263X 2

Taxons that carry positions that mutated elsewhere in the tree: taxon name #of homoplasic mutations hCoV-19/Norway/1455/2020|EPI_ISL_420135|2020-02-28 2 hCoV-19/Austria/CeMM0009/2020|EPI_ISL_419662|2020-03-14 2 hCoV-19/Malaysia/MKAK-CL-2020-5049/2020|EPI_ISL_416884|2020-02-20 2 hCoV-19/Wales/PHWC-26CB5/2020|EPI_ISL_422113|2020-03-28 2 hCoV-19/Wales/PHWC-26E46/2020|EPI_ISL_422075|2020-03-29 2 hCoV-19/USA/CT_9849/2020|EPI_ISL_426416|2020-03-08 2 hCoV-19/Australia/NSW207/2020|EPI_ISL_427807|2020-03-23 1 hCoV-19/England/200960041/2020|EPI_ISL_414008|2020-02-27 1 hCoV-19/Tianmen/HBCDC-HB-07/2020|EPI_ISL_412983|2020-02-08 1 hCoV-19/England/CAMB-73A19/2020|EPI_ISL_425267|2020-04-01 1 hCoV-19/Nanchang/JX90/2020|EPI_ISL_421245|2020-01-25 1 hCoV-19/England/20139047504/2020|EPI_ISL_420694|2020-03-28 1 hCoV-19/South_11_Korea/SNU01/2020|EPI_ISL_411929|2020-01 1 hCoV-19/USA/WI-91/2020|EPI_ISL_425148|2020-04-01 1 hCoV-19/Japan/DP0654/2020|EPI_ISL_416613|2020-02-17 1 hCoV-19/Canada/ON_PHL3476/2020|EPI_ISL_418380|2020 1 hCoV-19/Japan/DP0457/2020|EPI_ISL_416601|2020-02-16 1 hCoV-19/England/CAMB-74650/2020|EPI_ISL_425319|2020-03-28 1 hCoV-19/Japan/DP0287/2020|EPI_ISL_416589|2020-02-16 1 hCoV-19/Wales/PHWC-26A51/2020|EPI_ISL_422051|2020-03-30 1 hCoV-19/Germany/NRW-04/2020|EPI_ISL_414499|2020-02-26 1 hCoV-19/Beijing/Wuhan_IME-BJ07/2020|EPI_ISL_424360|2020-01-29 1 hCoV-19/Australia/VIC262/2020|EPI_ISL_419955|2020-03-21 1 hCoV-19/Wales/PHWC-24525/2020|EPI_ISL_419424|2020-03-18 1 hCoV-19/India/1621/2020|EPI_ISL_421671|2020-03-12 1 hCoV-19/Australia/VIC313/2020|EPI_ISL_420004|2020-03-24 1 hCoV-19/Australia/VIC171/2020|EPI_ISL_419868|2020-03-18 1 hCoV-19/Australia/NSW93/2020|EPI_ISL_427681|2020-03-19 1 hCoV-19/England/20144061104/2020|EPI_ISL_423550|2020-03-31 1 hCoV-19/Australia/VIC08/2020|EPI_ISL_416514|2020-03-15 1