Thanks for your work. However, as I tried to conduct protein structure clustering, the program always returns clusters based on proteins' substructure. I am wondering how can I get the cluster based on global structures?
--alignment-type 1 seems can get the global results based on TM align. Here is the code that I used:
./foldseek/bin/foldseek easy-cluster ./protein res tmp --lddt-threshold 0.5 --c 1 --alignment-type 1
A workaround for this is to use a tool like pdb-tools and give all your chains the same name (e.g. using pdb_chain -A [PDB FILE]). It doesn't scale well, but it's very doable if you don't have tens of thousands of files to cluster.
Thanks for your work. However, as I tried to conduct protein structure clustering, the program always returns clusters based on proteins' substructure. I am wondering how can I get the cluster based on global structures?
--alignment-type 1 seems can get the global results based on TM align. Here is the code that I used:
./foldseek/bin/foldseek easy-cluster ./protein res tmp --lddt-threshold 0.5 --c 1 --alignment-type 1
However, the results I get look like:
And I would like to get the format of the clusters similar to:
Thanks.