Identification of protein domain boundaries

choderalab / TargetExplorer

Database framework with RESTful API for aggregating genomic, structural, and functional data for target protein families.

GNU General Public License v2.0

6 stars 7 forks source link

Currently we just take the UniProt boundaries, which are based on Prosite annotations (profile-based regular expression searches).

We should look into whether there is a more appropriate procedure.

Analysis of a quality multiple sequence alignment (likely using both sequence and structural data), with hierarchical clustering, would probably be a good start.

This problem may also be informed by the results of systematic expression tests of kinase construct variants, which are currently in progress.

choderalab / TargetExplorer

Identification of protein domain boundaries #2