Open Kaihui-Cheng opened 1 week ago
Dear Kaihui-Cheng: 01: There are 10 pdb ID in 1a62_A, ..., 1bq8_A. If you are so kind to provide a list of all the PDB ID(12.6k filtered proteins) in all your dataset(only PDB ID). Then we( most readers of your paper) can choose the specific PDB to download. 02: In README "we have decided to provide the 100ns simulation data for all proteins for online download". Still, I see no instruction to download the 100ns of all protein. Could you help me about that. Thank you so much and I am looking forward of your reply. Best M
@meatball1982 Hi! Thank you for your valuable suggestions.
git lfs pull
(without specifying --include="{protein_id}/*"
) in step 3.Please let us know if you have any other questions or suggestions.
Make sure you have Git LFS installed:
Navigate to your
DATA_ROOT
and clone the source:Download data with a specific
protein_id
, for example1a62_A
:Merge the split-volume compression into one file and then unzip the
.tar.gz
file:Ok! Now we have the simulation data for
protein_id
. Note: Sufficient storage space is required for the data. For1a62_A
, 33GB is needed for the unzipped files and 24GB for the zipped files.