LidaWangPSU / EXPRESSO

EXpression PREdiction with Summary Statistics Only
8 stars 1 forks source link

EXPRESSO

EXpression PREdiction with Summary Statistics Only paper link

Table of contents

Introduction

EXPRESSO (EXpression PREdiction with Summary Statistics Only) could bulid gene expression model with eQTL summary statistics and reference panel only. It also integrates 3D genomic data to define cis-regulatory regions properly and uses epigenetic annotation to prioritize causal variants. It is developed and maintained by Lida Wang at Dajiang Liu's Group.

Installation

The package is hosted on github, which allows installation and update to be very easy. First, make sure you have the MASS, data.table, BEDMatrix and caret packages installed.

install.packages("devtools")
library(devtools)

And also, you need the latest version of fast.lasso and rareGWAMA to be installed.

devtools::install_github("zhanxw/fast.lasso")
devtools::install_github("dajiangliu/rareGWAMA")

Then you could install EXPRESSO from the repository here.

devtools::install_github("LidaWangPSU/EXPRESSO/EXPRESSO")
library(EXPRESSO)

Here we go.

Quick tutorial

Bulid gene expression prediction model

res.tmp <- EXPRESSO(sumstatFile,annoFile,windowFile,refFile,out_path,minMaf,maxIter,gene.vec,append=F)

Input includes

Output results

We perform EXPRESSO by three different tunning parameter methods, including pseudo variable selection and MSE.

The weight output includes:

The cv output includes:

Usage

We provided example input data here.

Data were subsetted from GTEx whole blood tissue as an example to run the script.

Example of R script used to run EXPRESSO can be found here.

Example of EXPRESSO output can be found here.

Contact

Lida Wang lida.wang.96@gmail.com