steineggerlab / foldseek

Foldseek enables fast and sensitive comparisons of large structure sets.
https://foldseek.com
GNU General Public License v3.0
780 stars 99 forks source link

Download pre-generated afdb.tar.gz #202

Closed LTEnjoy closed 11 months ago

LTEnjoy commented 11 months ago

Hi!

Thank you for providing such pre-generated databases for use. I wonder how can I download the largest afdb database as I found foldseek doesn't support it when I tried to use foldseek databases command.

Thank you in advance!

milot-mirdita commented 11 months ago

The full afdb is now part of the afdb50 download too, the afdb50 can be searched in clustered search mode where it expands the search results based on the found cluster representative or it you can use the afdb50_seq database which contains the full afdb (needed for cluster expansion).

Sorry for the confusing databases, we don’t want to store the afdb twice since we pay per month per gb of cloud storage.

LTEnjoy commented 11 months ago

Thanks for the answer! Previously I asked how to recover all 3Di sequences from a pre-generated database. Can I recover all 3Di sequences in the UniProt database given the afdb50? If can, which commands should I use?

Looking forward to your reply and thanks in advance!

milot-mirdita commented 11 months ago

Yes you can use the afdb50_seq_ss analogous to the commands in the other thread to create a FASTA file with 3di sequences

LTEnjoy commented 11 months ago

OK, I will try it out. Thank you very much!