exomiser / Exomiser

A Tool to Annotate and Prioritize Exome Variants
https://exomiser.readthedocs.io
GNU Affero General Public License v3.0
202 stars 55 forks source link

ClinPred #272

Closed damiansm closed 5 years ago

damiansm commented 6 years ago

When scores are publically available investigate performance - incorporates freq into score so woudl be a good replacement for us

One possibility would be to not incorporate freq into the default score calculations i.e. LOF that passes filters is always 1. For missense use ClinPred

julesjacobsen commented 6 years ago

ClinPred: Prediction Tool to Identify Disease-Relevant Nonsynonymous Single-Nucleotide Variants https://doi.org/10.1016/j.ajhg.2018.08.005

Missense only variants, similar to REVEL

In summary, we have developed an ensemble classifier for predicting disease relevance of missense SNVs, using a combination of two different machine learning algorithms and incorporating several popular pathogenicity predictors, along with population allele frequencies, as component features. Our classifier is specifically designed to predict pathogenicity of variants that are causative for Mendelian disease.

ClinPred data: https://sites.google.com/site/clinpred/

damiansm commented 5 years ago

The TEST pathogenicity scores should work now – I’ve added the latest build (exomiser-cli-12.0.0-SNAPSHOT) to /data/WHRI-Phenogenomics/projects/Jules/ along with a compatible ClinPred file.

To get it working you’ll need to:

  1. Have a bgzipped tabix-indexed file to hand in the format defined below.
  2. Add exomiser.hg19.test-pathogenicity-score-path=full/path/to/ClinPred.txt.gz to the application.properties file
  3. Add TEST to the pathogenicityScources: [] in the analysis.yml file

Then run as usual.

Test file format must be:

Currently the new scores can be seen in the HTML and JSON output.

damiansm commented 5 years ago

Tested now. See future ML issue for next steps