wilcas / vaecit

Variational autoencoders to use latent factors in causal inference
Apache License 2.0
0 stars 0 forks source link

Process Methylation Data to Individual Objects #9

Open wilcas opened 5 years ago

wilcas commented 5 years ago

In selection of other probes, there remains the issue of how to deal with data in expanding the number of mediation events detected. Ideally, I'd be using one nice LV method to reduce data down to 1 dimension, but it seems from simulation that this fails to capture the associations between that data and a target (i.e. more LVs are necessary).

I have a few suggestions as to how to continue/what to do to try to deal with this:

In each of the above cases I need to decide:

  1. How to group every feature by gene (fixed windows around TSS should be good enough)
  2. Whether or not it's valuable to pre-select sets of SNPs to group and re-do transcriptome/genome wide QT-LV study
  3. How I justify how the method accounts for complex non-linear interaction between omics data

    EDIT

    For now I'm going to group probes/peaks based on the gene they associate with within a window. Then I am going to take these top (bonferroni corrected threshold) QTMs/eQTAs and summarize them by LV.

To do this I will need to:

wilcas commented 5 years ago

Considerations

For now I am going to test probes/peaks and their association to a given gene's expression whose TSS is within 1 MB of that probe/peak.