santina / team_Undecided

2 stars 0 forks source link

Initial proposal feedback #1

Open singha53-zz opened 7 years ago

singha53-zz commented 7 years ago
Name Department/Program Experties/Interests GitHub ID
Arjun Baghela Bioinformatics Immunology & Transcriptomics @abaghela
Emma Graham Bioinformatics Machine Learning & Metabolomics @emmagraham
Allison Tai Bioinformatics Machine Learning & DNA Structure @faelicy
Eric Chu Bioinformatics Neuroscience & Transcriptomics @echu113

Team name: Undecided

One paragraph on the basic idea of the project:

Asthma is characterized by chronic inflammation, and affects over 400 million children and adults worldwide (1). The heterogeneity of the disease manifests as variation in clinical onset, responsiveness to treatment and comorbidities (2). Upstream events in the lung epithelial cells of the lower airway have been postulated to initiate Type II inflammation, which is mediated by CD4+ T cells, leading to cytokine production and remodeling of the cellular environment in the lower airway. Recent studies using RNA-seq data have characterized the Type II immune response in CD4+ T cells; however, the upstream events in epithelial cells that initiate this response remain unknown (3,4). A study recently published, which obtained RNA-Seq and methylation profiles for 76 asthma patients, investigated the genetic and epigenetic markers upregulated in lower airway epithelial cells during asthmatic responses (5). However, the conclusions of the study are limited by numerous confounding factors such as medication usage, comorbidities and artefacts of experimentation, which can obscure the detection of meaningful biological signals. Furthermore, the generation of interactive networks with WGCNA (6) in the aforementioned study may have removed meaningful connections in an attempt to reduce noise, and also has difficulties incorporating heterogeneous data. RNAseq and methylation data from lung epithelial cells in subjects with and without asthma will be analyzed to determine master regulator genes that initiate the Type II inflammatory response in lung epithelial cells. To begin our analysis, RNAseq data will be processed to remove the effect of confounding variables, and used to construct a co-expression network. Similarly, we will construct a co-expression network with differentially methylated CpGs (DMCs). Both these networks may give us insights into the genetic and epigenetic signatures that influence variation in asthma endotypes. We may try other analyses too, if we have time. These include determining whether methylation levels at DMCs are correlated with expression levels of nearby genes and integrating DMC and RNA-Seq data using a network-interaction based approach.

References

  1. Pawankar R. 2014. Allergic diseases and asthma: a global public health concern and a call to action. World Allergy Organ. J. 7: 12.
  2. Wesolowska-Andersen A, Seibold MA. Airway molecular endotypes of asthma: dissecting the heterogeneity. Curr Opin Allergy Clin Immunol. 2015;15(2):163–168. doi:
  3. Locksley RM. Asthma and allergic inflammation. Cell. 2010;140:777–783.
  4. Seumois, Grégory, et al. "Transcriptional profiling of Th2 cells identifies pathogenic features associated with asthma." The Journal of Immunology 197.2 (2016): 655-664
  5. Nicodemus-Johnson, Jessie et al. “DNA Methylation in Lung Cells Is Associated with Asthma Endotypes and Genetic Risk.” JCI Insight 1.20 (2016): e90151. PMC. Web. 26 Jan. 2017.
  6. Langfelder P, Horvath S. WGCNA: an R package for weighted correlation network analysis. BMC Bioinformatics. 2008;9:559. doi:10.1186/1471-2105-9-559.
singha53-zz commented 7 years ago

some comments to get you started:

first statement is incorrect: Asthma is characterized by chronic inflammation, and affects over 400 million children and adults worldwide (1).

Furthermore, the generation of interactive networks with WGCNA (6) in the aforementioned study may have removed meaningful connections in an attempt to reduce noise, and also has difficulties incorporating heterogeneous data.

RNAseq and methylation data from lung epithelial cells in subjects with and without asthma will be analyzed to determine master regulator genes that initiate the Type II inflammatory response in lung epithelial cells.

To begin our analysis, RNAseq data will be processed to remove the effect of confounding variables, and used to construct a co-expression network. Similarly, we will construct a co-expression network with differentially methylated CpGs (DMCs). Both these networks may give us insights into the genetic and epigenetic signatures that influence variation in asthma endotypes.

These include determining whether methylation levels at DMCs are correlated with expression levels of nearby genes and integrating DMC and RNA-Seq data using a network-interaction based approach.

Remember to add a table stating the division of labour (e.g. which individuals are involved in the study design, obtaining data, preprocessing data, cleaning data, QC of data, exploratory data analysis, statistical analyses, writing etc).

Happy to discuss further at Wednesday's seminar. Cheers

@santina