Please provide PDB and AlphaFold xref datasets for 2.8
For PDB 3D structures follow the selection criteria below
Always provide AlphaFold structures
Here are the selection criteria for PDB Protein Structures from @jeet-vora and Raja, which you can use for filtering the downloaded PDB files.
Rules:
Length - Select the PDB accessions/structure that contains the longest aa sequence.
Method - The structures resolved through the Xray method should be shortlisted first. If Xray structure is not available NMR structures are to be selected.
Resolution - From the shortlisted Xray structure choose the one with the highest resolution. NMR structure does not have a resolution, so select the NMR structure with the longest sequence.
Number of chains - If two structures have identical 1, 2 and 3 properties, then choose the accession with a lower number of chains.
@jeet-vora It's for Jie but I realised while making it need some more info from Robel on how he wants the datasets, so I'll ask him about it on Monday as it's not a priority
Please provide PDB and AlphaFold xref datasets for 2.8
Here are the selection criteria for PDB Protein Structures from @jeet-vora and Raja, which you can use for filtering the downloaded PDB files.
Rules:
Let me know if you need anymore information.
Example dataset: