glygener / glygen-issues

Repository for public GlyGen tickets
GNU General Public License v3.0
0 stars 0 forks source link

Empty protein_pdb_map.csv datasets #1594

Closed katewarner closed 2 months ago

katewarner commented 2 months ago

The sarscov2_protein_pdb_map.csv, hcv1a_protein_pdb_map.csv and hcv1b_protein_pdb_map.csv datasets are empty except for text along the lines of:

one of the required files [unreviewed/hcv1a_protein_canonicalsequences.fasta,unreviewed/hcv1a_protein_xref_alphafolddb.csv] does not exist!

These organisms/proteins do have PDB xrefs so I think they should have pdb_map datasets. However, I had a look on UniProt and these organisms don't have Alphafold xrefs which may be affecting your script. Are we including the Alphafold mapping in the "protein_pdb_map.csv" datasets?

rykahsay commented 2 months ago

I have removed them, they shouldn't have been created

katewarner commented 2 months ago

Okay many thanks