Closed knaegle closed 1 year ago
Unsure where the behavior originates, but for certain on line 69, handling PDB_ID as a dictionary key when more than one annotation entry can be returned is an issue, which would select/allow only one storage item. Will need to generate new keys (e.g. PDBID_ENTITYID) to handle unique entries in a PDB as part of dictionary. Entity_ID is the way we originally handled this.
See attached image and 1D4W as an example where there are two entities and two chains in each entity.
Here's the current report. It's clear that it knows there are two entities, as there are two chain lengths, but there is only one sequence reported.
Description
PDB annotation, via the generateStructureRefFile is returning only one of the unique chains when more than one protein chain is provided.
Screenshots
Desired behavior, example for 2OQ1
Files
To Reproduce
Steps to reproduce the behavior:
Expected behavior
It should produce two lines, one for chain A (uniprot id P43403) and one for Chain B (uniprot id P20963). It currently suggests that both A,B belong to P43403.