Closed damiansm closed 5 years ago
ClinPred: Prediction Tool to Identify Disease-Relevant Nonsynonymous Single-Nucleotide Variants https://doi.org/10.1016/j.ajhg.2018.08.005
Missense only variants, similar to REVEL
In summary, we have developed an ensemble classifier for predicting disease relevance of missense SNVs, using a combination of two different machine learning algorithms and incorporating several popular pathogenicity predictors, along with population allele frequencies, as component features. Our classifier is specifically designed to predict pathogenicity of variants that are causative for Mendelian disease.
ClinPred data: https://sites.google.com/site/clinpred/
The TEST pathogenicity scores should work now – I’ve added the latest build (exomiser-cli-12.0.0-SNAPSHOT) to /data/WHRI-Phenogenomics/projects/Jules/ along with a compatible ClinPred file.
To get it working you’ll need to:
Then run as usual.
Test file format must be:
Currently the new scores can be seen in the HTML and JSON output.
Tested now. See future ML issue for next steps
When scores are publically available investigate performance - incorporates freq into score so woudl be a good replacement for us
One possibility would be to not incorporate freq into the default score calculations i.e. LOF that passes filters is always 1. For missense use ClinPred