How to download PDB and Alphafold data

glygener / glygen-issues

Repository for public GlyGen tickets

GNU General Public License v3.0

0 stars 0 forks source link

How to download PDB and Alphafold data #1755

Closed katewarner closed 1 month ago

katewarner commented 1 month ago

Can you please provide instructions on how to download PDB and Alpafold data for each release? I will then document the process in our data downloads file.

rykahsay commented 1 month ago

@katewarner @jeet-vora @ubhuiyan I guess I have to keep doing this since datasets _protein_pdb_map.csv and _protein_xref_alphafolddb.csv must be created first (which depend on EBI nt downloads).

Instead, during every release, make sure to create a ticket for me on updating the following downloads I do on my end:

medline
pubchem
pdb
alphafold

katewarner commented 1 month ago

@rykahsay

Okay no problem. At which point in the release would you like the ticket? When we've started the downloading or when we've completed our downloads and they're ready for processing?

ubhuiyan commented 1 month ago

@rykahsay @katewarner Just a heads up, I believe I write a ticket for Robel to download the Medline files as per the Glycan Download Instructions, would we prefer to consolidate all these resources into a single ticket? I typically write the ticket while I am downloading Nathan's exports.

katewarner commented 1 month ago

@ubhuiyan sounds like a good idea. You would have to wait until I've downloaded and QC'd the protein datasets to make the ticket, but I don't see that as a problem as long as the downloads have no issues since I've usually finished the downloads when you're preparing the files for Nathan

rykahsay commented 1 month ago

Yea, one ticket is good idea --- the ticket should be created at the beginning of the release

ubhuiyan commented 1 month ago

Sure thing. I can add this to the documentation.

ubhuiyan commented 1 month ago

I have added this to the documentation. Closing the ticket now.