steineggerlab / foldseek

Foldseek enables fast and sensitive comparisons of large structure sets.
https://foldseek.com
GNU General Public License v3.0
780 stars 99 forks source link

Clustering ProstT5 DB? #361

Open HaimAshk opened 14 hours ago

HaimAshk commented 14 hours ago

Dear FoldSeek team,

Sorry for maybe a very novice question, I could not find the answer in the documentation... I have a ProstT5-encoded DB for foldseek. Is there an easy way to cluster all the proteins included in the DB? It seems easy-cluster scheme does not accept a DB as an input (or I missed it). How can I achieve it?

Thanks for any advice! Haim

milot-mirdita commented 14 hours ago

That should work. What did you execute? You might need to use cluster instead of easy-cluster since the latter only accepts structures and not databases as input

HaimAshk commented 14 hours ago

Thanks for the prompt response, Milot! Yes, using just cluster seems to work. Thanks!