griffithlab / civic-client

Web client for CIViC: Clinical Interpretations of Variants in Cancer
MIT License
50 stars 28 forks source link

Create a visualization of mutations relative to the protein sequence, its domains, mutation hotspots, etc. #66

Open malachig opened 9 years ago

malachig commented 9 years ago

Since we have mutations that are point sites or are more generally in certain domains or other large features of the protein. For example, MD Anderson creates these for each gene and they are customized when you click on each mutation: screen shot 2015-03-04 at 3 13 17 pm

jmcmichael commented 9 years ago

At some point I can definitely create a d3 visualization that can generate these in the client, I just need to know what data I will be getting to generate the figure and how it corresponds to the MD Anderson example.

malachig commented 9 years ago

Yeah, I think we should keep this on the wishlist for now. There is actually a somewhat complex mix of data being represented here. From left to right:

The raw material for most of the data needed could be obtained from the Ensembl API (or similar). The variant landscape could come from Cosmic, the ICGC data portal, TCGA, etc.

Zach has been working on a similar visualization concept in R using R modules (Biomart?) to get the raw data and ggplot2 to create the image. If we decide to tackle this at some point we should probably get together a group and brainstorm on some of the specifics.

kkrysiak commented 8 years ago

The GenVisR (Zach's R modules) protein visualization continues to evolve. https://github.com/griffithlab/GenVisR

Alternatively, in many cases we have all the fields to generate a lolliplot via cBioPortal. http://www.cbioportal.org/mutation_mapper.jsp The source code is also available on GitHub under a GNU Affero General Public License V3 https://github.com/cBioPortal/cbioportal

kkrysiak commented 5 years ago

Example of compressed introns vs regular in GenVisR https://bioconductor.org/packages/release/bioc/vignettes/GenVisR/inst/doc/Intro.html#133_gencov_(sequence_coverage_graphic)

kkrysiak commented 5 years ago

Consider embedding protein paint. https://docs.google.com/document/d/1KNx4pVCKd4wgoHI4pjknBRTLrzYp6AL_D-j6MjcQSvQ/edit

kkrysiak commented 5 years ago

From discussions with members of the Melbourne Genomics Health Alliance, this is a priority. Partially because we don't have a nice way to dig through variants that we do have. Aliases only work on the quick search, browse and advanced search pages. On a gene page, unless you are searching for an exact variant, you can't get there.

kkrysiak commented 5 years ago

Everyone loves the COSMIC way of filtering...there is a histogram of events. That sort of thing would work for us as well. I love the lolliplot idea but I don't know about implementation. Shahil's visual (lolliplot that replaced variant occurrences with evidence count) was awesome for this sort of thing. But lolliplots don't really work that well for gene-level events.

malachig commented 4 years ago

We have talked about the idea of collaborating with the ProteinPaint team at St. Judes to resolve this issue. Its not done yet, but it could be a way to resolve this...