theosanderson / taxonium

A tool for exploring very large trees in the browser
http://taxonium.org
GNU General Public License v3.0
95 stars 17 forks source link

MatUtils CLI to identify mutation spread in large populations #575

Closed Biophylo2001 closed 5 months ago

Biophylo2001 commented 5 months ago

After making MAT tree file with usher, I am confused on whether to use matUtils extract or summary to find mutations that have occurred or spread frequently/ within large group of population

Maybe I am getting it wrong but is it possible? Or do we have to manually look:dig in through our visualization in Treenome to look for mutations spread in large populations?

theosanderson commented 5 months ago

Sorry you'd have to ask the UShER team about matUtils.

With Taxonium it isn't very possible at the moment to search for mutations that have spread frequently, but if you have a mutation and want to know how many times it has been seen on the tree, possibly with a minimum number of descendants for each node, you can use mutation search.

AngieHinrichs commented 5 months ago

find mutations that have occurred or spread frequently/ within large group of population

I don't know of a way to do this directly in matUtils, but a tool called pyR0 has been published to do just that:

https://www.science.org/doi/10.1126/science.abm1208 https://github.com/broadinstitute/pyro-cov/

Biophylo2001 commented 5 months ago

@AngieHinrichs thank you for letting me know about Pyrocov, but I fail to understand what kind of input should be given to identify mutation spread, more specifically to plot it. It says to load .pt file but how do I make this? I couldn't find any docs for this. What data needs to be loaded for this? If you know this please let me know. I have asked to Pyrocov as well but got no replies,

AngieHinrichs commented 5 months ago

Hi @Sanyukta2001, I have not run pyrocov so I don't know what the .pt file is, sorry.