jonathanking / sidechainnet

An all-atom protein structure dataset for machine learning.
BSD 3-Clause "New" or "Revised" License
330 stars 38 forks source link

There are some wrong PDB IDs!!! #21

Closed xiongzhp closed 3 years ago

xiongzhp commented 3 years ago

There are some wrong PDB IDs like '5DI3_d5di3a1', '5AHS_d5ahsf1', '3G5G_d3g5gl-' with repeating string.

jonathanking commented 3 years ago

These are ASTRAL IDs, not strictly PDB IDs. See https://github.com/aqlaboratory/proteinnet/issues/1#issuecomment-375270286 for more information.