santina / team_Undecided

2 stars 0 forks source link

Notes for Monday Feb6th Meeting #2

Open abaghela opened 7 years ago

abaghela commented 7 years ago

Hey people,

I will also try to put down what I remember of Amrits conversation with us.

abaghela commented 7 years ago

So it looks like we want to do networky things...

This is kind of repetitive...

I am not actually sure how to create networks with differentially expressed CpG (DMCs). Seems like they use WGCNA to make their weighted correlation network to make modules. Then they use ingenuity pathway analysis to identify protein-protein interactions of proteins associated with each module. I want to know if we can find the genes associated with each CpG and directly create networks from there. I will research more about this CpG network idea.

Here are the approaches I am thinking. 1) Cluster patients based on their RNA-Seq and methylation profiles. Then, we can see which genes/DMC significantly drove the formations of those clusters. I know a couple tools that can do this. 2) Cluster the patients as above. Would we be able to create networks of the genes/DMCs and see how the networks vary between clustered patients. 3) Create networks with the differentially expressed genes AND the differentially expressed CpGs. We can compare the networks then.

Networks can be created with Network Analyst, which makes protein-protein interactions with genes. Clustering can be done with a whole bunch of tools.

abaghela commented 7 years ago

Notes from Meeting on Feb 6th

Emma

QUESTION? Are there ways to clean up the methylation data?

ppavlidis commented 7 years ago

@abaghela About "I know a couple of tools...": Bear in mind that we want you to do your work in R, for the most part. Using additional (non-R) tools is fine if there's no good alternative (maybe ask first?), but it's then especially important to not treat them as "black boxes": understand how they work (assuming it's not something we teach in class). Thus I don't really recommend using NetworkAnalyst for analysis. If you just need PPI data, maybe it has that, but you can get it from a more primary source like BioGRID and work with in R. Beware that databases like STRING and BioGrid have a lot of types of "interactions" besides PPIs, so you need to filter accordingly.

I may have some comments about the network analysis parts, but I'll wait for you to flesh it out.