This adds protein diagrams to gene leads tooltips. These convey sequence-based, functional information for genes.
Previously, only transcripts / gene structures were shown. These depict the layout of exons, introns, UTRs, and CDS. While useful for comparing exon layout between paralogs, or -- with future enhancements -- how assayed RNA sequence data in a sample relates to reference information, the gene structure diagrams in tooltips were somewhat abstract.
Now, protein diagrams appear below the CDS region of spliced transcript diagrams. Each subpart or segment in a protein diagram represents a feature from InterPro, EBI's protein database. The subparts are protein domains, families, active sites, conserved sites, etc. Hovering over a protein subpart shows its name, which is often its biochemical function.
As an example of such diagrams in the scientific literature, consider these diagrams from Figure 1 in Wu et al. 2011 (PMID 22389749) showing gene (pre-mRNA), mature mRNA, and protein diagrams for the gene AGT:
The serpine domain is now shown for the AGT gene, along with a conserved site in that domain.
Coverage: 86.68% (+0.1%) from 86.584% when pulling 683f34a45f57788a4bfcc996d7d8297b12d235a5 on protein-diagrams into 90011448b88d5f255e2d865d3e3acaaf7fbf6af1 on master.
This adds protein diagrams to gene leads tooltips. These convey sequence-based, functional information for genes.
Previously, only transcripts / gene structures were shown. These depict the layout of exons, introns, UTRs, and CDS. While useful for comparing exon layout between paralogs, or -- with future enhancements -- how assayed RNA sequence data in a sample relates to reference information, the gene structure diagrams in tooltips were somewhat abstract.
Now, protein diagrams appear below the CDS region of spliced transcript diagrams. Each subpart or segment in a protein diagram represents a feature from InterPro, EBI's protein database. The subparts are protein domains, families, active sites, conserved sites, etc. Hovering over a protein subpart shows its name, which is often its biochemical function.
As an example of such diagrams in the scientific literature, consider these diagrams from Figure 1 in Wu et al. 2011 (PMID 22389749) showing gene (pre-mRNA), mature mRNA, and protein diagrams for the gene AGT:
The serpine domain is now shown for the AGT gene, along with a conserved site in that domain.
Here's how it looks for ACE2 and AGT:
https://user-images.githubusercontent.com/1334561/209889723-7ad02daa-432f-4407-9a2b-736282a715f9.mov