steineggerlab / foldseek

Foldseek enables fast and sensitive comparisons of large structure sets.
https://foldseek.com
GNU General Public License v3.0
693 stars 91 forks source link

Using github command didn't download full afdb? #280

Open bregman3 opened 1 month ago

bregman3 commented 1 month ago

hello When I use the download databases from the github, the total amount of files that get downloaded data-wise are significantly below the size of what the dataset is supposed to be according to the afdb website. I do not get an error message when following the github steps but the resulting files are only about ~1 TB compared to the website's 23 TiB (~25 TB). Am I doing something wrong?

milot-mirdita commented 1 month ago

That sounds about right. The foldseek AFDB database doesn't contain the full PDB files, but only amino-acid, 3Di and C-alpha backbone information. That is all information that Foldseek uses for searching.

bregman3 commented 4 weeks ago

thank you