nrnb / GoogleSummerOfCode

Main documentation site for NRNB GSoC project ideas and resources
114 stars 38 forks source link

Structural similarity network tool #58

Closed AlexanderPico closed 8 years ago

AlexanderPico commented 8 years ago

Background

Protein structure determination methodologies create sizable databases of X-ray solved structures (http://www.rcsb.org/pdb/). Similarities between these structures are invaluable for understanding of structure and function evolution, of both specific proteins of interest, or members of the same class of proteins. One way to represent these similarities is by structural similarity networks, in which nodes represent protein structures and edges represent pairwise similarities between them. These have proved to be very useful in the past by our lab in demonstrating structure-based grouping of proteins and understanding remote homologies between them. Users can greatly benefit from a standalone software package that inputs structure identifiers and outputs the similarity network, especially if their specific proteins of interest are highlighted in the “structure space” context.

Goal

The goal of this project is to create a python-based software package that inputs structure identifiers, computes pairwise similarities (TM-align/chimera matchmaker), reduce redundancy by grouping structures that are very similar to each other, and finally outputs a structural similarity network with some node metadata retrieved from the RCSB database. Such networks can be visualized using Cytoscape (local installation) or cytoscapejs.

Difficulty level: 1

Technology and Skills

AlexanderPico commented 8 years ago

postponed until mentors are available