Repo structure:
generate_figures.R :: Script generating the final figures for the paper
data/ :: Directory containing the data and upstream preprocessing scripts
analysis/ :: Directory containing the analyses performed as part of this study
misc/ :: Misc data, helper functions, and scripts
TLDR: This repo is for a research paper that develops 2-3 approaches for addressing one question: "How do cancers relate to normal tissues?" In another sense, this question is asking:
Here are the three aims:
This repository is based on a simple idea:
There is are a discrete number of Normal cellular states which are defined by the expression of specific genes in specific pathways -- and that we can learn these states using RNA-Sequencing. One popular depiction of this idea is the Tabula Muris - article:
From our recent Cancers paper, we showed that at least one cancer (Ewing sarcoma) has a cellular state which is similar to the cellular states found during normal embryonic development. More improtant, we learned how certain interventions change the state of Ewing sarcoma tumor cells. This led us to realize new drug targets which were subsequently demonstrated (albeit not by us...). However, the method we used (PHATE) is more qualitative with respect to answer questions of the normal-cancer relationship. For a larger-scale implementation of this approach, we need more quantitative tools in addition to qualitative ones.
We plan to extend the method we showed previously to all other cancers. Then, we hope to make a web application where users can explore their cancer of interest and see how it relates to normal tissues.
Using a method similar to PLIER, we can incorporate pathway-level information to score cancer samples based on their expression of pathways which are related to particular normal tissues. For example, we might have found that Ewing sarcoma has a score of 50% Mesodermal and 50% pluripotency. How does an intervention change these scores? This would give us an innovative and quantitative way to assess the same questions posed above.
This Dr. Zheng's idea which may provide an interesting supplement to our other analyses -- but he hasn't yet disclosed the details of his approach to me.