Closed carlo-berg closed 1 year ago
Ah, the tsv file you've referenced is an old one. For the last year or so Freyja has included these estimates of genome coverage as a standard output. We do still accommodate the old format to ensure backwards compatibility.
It seems like you've run into a parsing problem, possibly as a function of low seq coverage like #166 or some other syntax issue that isn't quite being handled correctly by the default parser.
Any chance you can send a copy of the freyja_agg.tsv file my way? Happy to sort out what's going on. if you'd rather not share it here, my email is jolevy@scripps.edu
Ah, I see. Yes, it has low coverage as its actually a negative sample.
freyja_agg.tsv
:
summarized lineages abundances resid coverage
file1.tsv [('Other', 0.3598697557138268), ('Omicron', 0.19702381428535437), ('CH.1.1* [Omicron (CH.1.1.X)]', 0.1717847147673795), ('XBB.1.5* [Omicron (XBB.1.5.X)]', 0.1717847147673795), ('Epsilon', 0.06249999999989101), ('Delta', 0.03703699999997585)] CH.1.1.27 FT.1 BN.1.4.1 BN.1.4.3 BN.1.4.2 B.1.429.1 B.1.44 AY.77 B.1.1.320 B.1.318 B.1.1.135 N.10 B.1.1.160 B.1.1.284 B.1.1.392 BQ.1.1.50 B.1.1.286 B.1.1.358 C.2.1 AZ.2 AZ.2.1 B.1.633 B.1.1.328 B.1.1.329 0.17178471 0.17178471 0.06250000 0.06250000 0.06250000 0.06250000 0.03703700 0.03703700 0.03703700 0.03703700 0.03703700 0.03703700 0.03703700 0.03703700 0.03703700 0.00952381 0.00952381 0.00952381 0.00952381 0.00952381 0.00952381 0.00952381 0.00321543 0.00321543 0.0792273631555323 0.2809082700732368
file2.tsv [('Other', 0.3598697557138268), ('Omicron', 0.19702381428535437), ('CH.1.1* [Omicron (CH.1.1.X)]', 0.1717847147673795), ('XBB.1.5* [Omicron (XBB.1.5.X)]', 0.1717847147673795), ('Epsilon', 0.06249999999989101), ('Delta', 0.03703699999997585)] CH.1.1.27 FT.1 BN.1.4.1 BN.1.4.3 BN.1.4.2 B.1.429.1 B.1.44 AY.77 B.1.1.320 B.1.318 B.1.1.135 N.10 B.1.1.160 B.1.1.284 B.1.1.392 BQ.1.1.50 B.1.1.286 B.1.1.358 C.2.1 AZ.2 AZ.2.1 B.1.633 B.1.1.328 B.1.1.329 0.17178471 0.17178471 0.06250000 0.06250000 0.06250000 0.06250000 0.03703700 0.03703700 0.03703700 0.03703700 0.03703700 0.03703700 0.03703700 0.03703700 0.03703700 0.00952381 0.00952381 0.00952381 0.00952381 0.00952381 0.00952381 0.00952381 0.00321543 0.00321543 0.0792273631555323 0.2809082700732368
Hi,
if I run
freyja plot
on the aggregated file it gives a key error.First line of aggregated file:
summarized lineages abundances resid coverage
I then looked into the sample aggregated file at https://github.com/andersen-lab/Freyja/blob/main/freyja/data/aggregated_result.tsv and noticed that it does not contain the
coverage
column. So if I remove thecoverage
column from my aggregated filefreyja plot
runs and creates the plot as follows.Versions: