soedinglab / MMseqs2

MMseqs2: ultra fast and sensitive search and clustering suite
https://mmseqs.com
GNU General Public License v3.0
1.38k stars 195 forks source link

create phylogeny from clustering structures #683

Open avilella opened 1 year ago

avilella commented 1 year ago

is there a way to create a phylogeny from clustering structures rather than sequences?

I've successfully run:

foldseek createdb example/ db
foldseek search db db aln tmpFolder -c 0.8 
foldseek clust db aln clu
foldseek createtsv db db clu clu.tsv

But I would like to obtain a phylogeny output rather than the tsv output file. Is this possible? Thx

martin-steinegger commented 1 year ago

To utilize the taxonomy features in MMseqs2/Foldseek, you must first label the sequences in your database with taxonomy IDs. You can find detailed instructions on how to do this in our user guide wiki: https://github.com/soedinglab/MMseqs2/wiki#taxonomy-assignment. I recommend posting this question on the FoldSeek GitHub page. :)