EXpression PREdiction with Summary Statistics Only paper link
EXPRESSO (EXpression PREdiction with Summary Statistics Only) could bulid gene expression model with eQTL summary statistics and reference panel only. It also integrates 3D genomic data to define cis-regulatory regions properly and uses epigenetic annotation to prioritize causal variants. It is developed and maintained by Lida Wang at Dajiang Liu's Group.
The package is hosted on github, which allows installation and update to be very easy. First, make sure you have the MASS, data.table, BEDMatrix and caret packages installed.
install.packages("devtools")
library(devtools)
And also, you need the latest version of fast.lasso and rareGWAMA to be installed.
devtools::install_github("zhanxw/fast.lasso")
devtools::install_github("dajiangliu/rareGWAMA")
Then you could install EXPRESSO from the repository here.
devtools::install_github("LidaWangPSU/EXPRESSO/EXPRESSO")
library(EXPRESSO)
Here we go.
res.tmp <- EXPRESSO(sumstatFile,annoFile,windowFile,refFile,out_path,minMaf,maxIter,gene.vec,append=F)
Input includes
We perform EXPRESSO by three different tunning parameter methods, including pseudo variable selection and MSE.
The weight output includes:
The cv output includes:
We provided example input data here.
Data were subsetted from GTEx whole blood tissue as an example to run the script.
Example of R script used to run EXPRESSO can be found here.
Example of EXPRESSO output can be found here.
Lida Wang lida.wang.96@gmail.com