NaegleLab / CoDIAC

GNU General Public License v3.0
0 stars 0 forks source link

A missing STAT protein domain in reference files #38

Closed alekhyaa2 closed 9 months ago

alekhyaa2 commented 9 months ago

Is your feature request related to a problem? Please describe. On re-running the scripts to generate reference files for SH2 domains, we found that the STAT proteins include a STAT_linker domain that was missing in the earlier generated files (uniprot reference and PDB reference)

From non-canonical feature set, this STAT_linker domain binds to the SH2 domain and found features for STAT3, STAT1, STAT6, STAT5A, and STAT2.

knaegle commented 9 months ago

Note: this STAT_linker showed up when Alekhya reran integration and reference from integration_bug branch, now see STAT_linker domain in reference and PDB. Will add to this ticket, what changes may have occurred to bring that in for closure.

knaegle commented 9 months ago

I don't have an answer for this, other than that I can replicate that now Interpro fetch includes STAT Linker regions. I suspect it might be that interpro updated and that is now a region returned.