NaegleLab / CoDIAC

Other
0 stars 0 forks source link

Add annotation to map between the Uniprot Reference and Structure Reference and CIF experimental sequence numbers #6

Closed knaegle closed 1 year ago

knaegle commented 1 year ago

Is your feature request related to a problem? Please describe. Develop (from pieces in our dev environment) the check between a Uniprot reference file (CSV) and the structure reference file and the Arpeggio generated contact maps, to find the offsets (if any) in the numbering of files. Additionally, this annotation will denote the regions/domains spanned in a Unprot protein by the structure that was experimentally performed.

Describe the solution you'd like A clear and concise description of what you want to happen.

Describe alternatives you've considered A clear and concise description of any alternative solutions or features you've considered.

Tasks

Include specific tasks in the order they need to be done in. Include links to specific lines of code where the task should happen at.

Additional context Add any other context or screenshots about the feature request here.

knaegle commented 1 year ago

Note that instead of keeping an offset between the PDB files and the reference, moving the binary contact adjacency files to use the correct and same numbering as reference.

knaegle commented 1 year ago

@alekhyaa2 are we OK to close this issue because the Arpeggio code you built correctly moves to the reference positions?