greenelab / core-accessory-interactome

Investigating the functional relationship between P. aeruginosa core and accessory genes.
BSD 3-Clause "New" or "Revised" License
1 stars 1 forks source link

Add more exploratory analysis from discussion with collaborator #44

Closed ajlee21 closed 3 years ago

ajlee21 commented 3 years ago

This PR is adding and updating analyses to continue to explore the data to figure out what set of genes and what story to tell about core and accessory genes.

Changes include:

  1. Exploring accessory genes most related to exoU and exoS genes (3_explore_exo_modules.ipynb)
  2. Update notebooks in 5_core_core_analysis/ directory to output annotations for all core genes, not sure those that are most and least stable
  3. Add KEGG enrichment analysis to identify any pathways that are significantly enriched in our most and least stable gene sets (4_KEGG_enrichment_of_stable_genes.ipynb)
  4. Update common DEG analysis after training two new VAE models using the PAO1 and PA14 RNA-seq compendia defined in this repository. This change is coupled with https://github.com/greenelab/generic-expression-patterns/pull/96. Note: I plan to make another PR to copy the necessary scripts from the generic-expression-patterns repo into this repo to use so that the analysis is all in one place. Ideally it would probably had made sense to have the scripts from the generic-expression-patterns repo be its own portable module, but this is the easiest clean way I can think to do this. (1_common_DEGs_and_gene_groups.ipynb)
nrosed commented 3 years ago

Look good to me! To be honest, I can't give in-depth feedback on your methodology/approach, but from what I understand it seems clear and well-motivated. Only very minor comments.

ajlee21 commented 3 years ago

Thank you for the comments. I know all the analyses are a bit hard to follow at the moment because there isn't a clear story just yet, its still in the works. So thanks for taking a look.