How is the PDB100 database prepared?

steineggerlab / foldseek

Foldseek enables fast and sensitive comparisons of large structure sets.

GNU General Public License v3.0

695 stars 92 forks source link

If I didn't get it wrong, the PDB100 database was built based on 100% sequence identity clustered PDB. I checked the pdb.lookup file, which supposedly contains all the pdb_chain IDs, and found some strange chain IDs were included, like 1a0n_MODEL_1_B , 1a0n_MODEL_2_B and 1a0n_MODEL_3_B. I could not find the corresponding chain that named this from the 1a0n from PDB. And what is the difference between these 1a0n_MODEL_*_B chains?

I'll be much appreciate if you could help me with this problem. Many thanks.

steineggerlab / foldseek

How is the PDB100 database prepared? #217