greenelab / multi-plier

An unsupervised transfer learning approach for rare disease transcriptomics
BSD 3-Clause "New" or "Revised" License
44 stars 10 forks source link

Pathway separation #52

Closed jaclyn-taroni closed 5 years ago

jaclyn-taroni commented 5 years ago

Closes #40

Here, I'm adding a notebook (32-explore_pathway_separation.Rmd) that looks into the notion of "pathway separation." The logic in CheckPathwaySeparation is the part where another set of :eyes: would be most appreciated/helpful!


I define pathway separation to be: Given two sets of related gene sets (a set of related gene sets would be all the monocyte/macrophage gene sets) that are somewhat similar (e.g., neutrophil set and monocyte/macrophage set), does the model: 1) have at least one latent variable significantly associated with at least one pathway in both sets (e.g. both sets are captured)? 2) have each set uniquely or separately represented (e.g., there exists at least one latent variable that is not significantly associated with both sets of and this is true for both sets)?

To be a bit more concrete: a model has at least one neutrophil-associated LV that is not associated with any of the monocyte/macrophage gene sets and at least one monocyte/macrophage-associated LV that is not associated with any of the neutrophil gene sets.

I'm checking for the separation of three sets of pathway pairs:

Figure, edited from the "raw" PDFs in this PR:

pathway_separation_better_contrast

Summary: