annahutch / corrcoverage

Repo for corrcoverage package
Other
6 stars 3 forks source link

corrcoverage

Coverage
status Build
Status

Webpage: https://annahutch.github.io/corrcoverage/

Manuscript: https://doi.org/10.1371/journal.pcbi.1007829

The corrcoverage R package uses a computationally efficient algorithm to find accurate coverage estimates of the causal variant in credible sets of prioritised variants from single causal variant genetic fine mapping (Maller et al., 2012, Wakefield, 2009.).

The package only requires GWAS summary statistics and can be used to:

We've strived to make our R package as easy to use as possible. Please see the flowchart below to decide which function is best to solve your problem. The interactive version (click-to-functions) is available here.


Installation

We recommend that all users download the package straight from github, as this contains a more complete version of the package:

install.packages("devtools") # if not already installed
devtools::install_github("annahutch/corrcoverage")

Alternatively (and if using a solaris operating system) download straight from cran using:

install.packages("corrcoverage")

Examples

For examples, please see the relevant vignettes.

  1. The Corrected Coverage vignette here should be read first. This shows readers how to use the corrcoverage R package to get an accurate coverage estimate of the causal variant in a credible set.

  2. The Corrected Credible Set vignette here follows on from the Corrected Coverage vignette and shows readers how the corrcoverage R package can be used to obtain a new credible set with the desired coverage of the causal variant.

  3. The Useful Info vignette here provides supplementary information about the usage of the package, including information about other useful functions.


In brief, the correction method involves simulating many credible sets from the same system as the original and calculating what proportion of these contain the true causal variant. Since the true causal variant is unknown, each variant is considered as causal in turn and the proportions are normalised by that variant’s posterior probability of causality.


Abstract

Genome Wide Association Studies (GWAS) have successfully identified thousands of loci associated with human diseases. Bayesian genetic fine-mapping studies aim to identify the specific causal variants within GWAS loci responsible for each association, reporting credible sets of plausible causal variants, which are interpreted as containing the causal variant with some "coverage probability". Here, we use simulations to demonstrate that the coverage probabilities are over-conservative in most fine-mapping situations. We show that this is because fine-mapping data sets are not randomly selected from amongst all causal variants, but from amongst causal variants with larger effect sizes. We present a method to re-estimate the coverage of credible sets using rapid simulations based on the observed, or estimated, SNP correlation structure, we call this the “adjusted coverage estimate”. This is extended to find “adjusted credible sets”, which are the smallest set of variants such that their adjusted coverage estimate meets the target coverage. We use our method to improve the resolution of a fine-mapping study of type 1 diabetes. We found that in 27 out of 39 associated genomic regions our method could reduce the number of potentially causal variants to consider for follow-up, and found that none of the 95\% or 99\% credible sets required the inclusion of more variants – a pattern matched in simulations of well powered GWAS. Crucially, our method requires only GWAS summary statistics and remains accurate when SNP correlations are estimated from a large reference panel. Using our method to improve the resolution of fine-mapping studies will enable more efficient expenditure of resources in the follow-up process of annotating the variants in the credible set to determine the implicated genes and pathways in human diseases.


Contact

If you have any questions or queries please contact me at anna.hutchinson@mrc-bsu.cam.ac.uk and I will be happy to assist.