bihealth / auto-acmg

A tool for automatic classification of sequence variants according to ACMG criteria.
GNU General Public License v3.0
1 stars 0 forks source link

Finish `AutoPP3BP4` #118

Closed gromdimon closed 2 months ago

gromdimon commented 2 months ago

Is your feature request related to a problem? Please describe. After implementaino of #77 we have AutoPP3BP4 class. Now we need to properly implement all the methods and test them

Describe the solution you'd like

Describe alternatives you've considered N/A

Additional context Information about PP3 and BP4

PP3 (in silico predictions)

Original Definition

Multiple lines of computational evidence support a deleterious effect on the gene or gene product (conservation, evolutionary, splicing impact, etc).

Caveats:

- As many in silico algorithms use the same or very similar input for their predictions, each algorithm should not be counted as an independent criterion.
- PP3 can be used only once in any evaluation of a variant.

-- Richards et al. (2015); Table 4

Preconditions / Precomputations

Implemented Criterion

An initial prediction is fist done using the general purpose pathogenicity predictors.

Then, for splicing the following is done.

The highest-scoring variant is used for the final prediction.

User Report

Caveats

Notes

BP4

BP4 (in silico predictions)

.. note::

- we have not implemented MitoTip or MitImpact yet
- we are lacking phylop scores yet
- we don't have live CADD scores yet

Original Definition

Multiple lines of computational evidence suggest no impact on gene or gene product (conservation, evolutionary, splicing impact, etc).

Caveat: As many in silico algorithms use the same or very similar input for their predictions, each algorithm cannot be counted as an independent criterion.
BP4 can be used only once in any evaluation of a variant.

-- Richards et al. (2015); Table 4

Preconditions / Precomputations

Implemented Criterion

See :ref:acmg_seqvars_criteria-pp3 for details.

User Report

See :ref:acmg_seqvars_criteria-pp3 for details.

Literature

See :ref:acmg_seqvars_criteria-pp3 for details.

Caveats

See :ref:acmg_seqvars_criteria-pp3 for details.

Notes

Intervar

Intervar

PP3 and BP4 by Automated Scoring When multiple pieces of computational evidence support a deleterious effect on the gene or gene product (conservation, evolutionary, splicing impact, etc.), then the supporting pathogenic evidence of PP3 will be assigned as 1. In comparison, when multiple pieces of computational evidence suggest no impact on the gene or gene product, then supporting benign evidence of BP4 will be assigned as 1. All sets of in silico results must agree when PP3 or BP4 is assigned. These multiple pieces of computational evidence can be provided by ANNOVAR from the “dbnsfp30a” database, where the MetaSVM score16 is used for deleteriousness prediction and GERP++ is used for evolutionary conservation. The splicing impacts can be inferred by ANNOVAR from the “dbscsnv11” database. For the evidence of PP3 and BP4, we set the cutoff to 0.0 for MetaSVM scores (greater scores indicate more likely deleterious effects), 2.0 for GERP++_RS (smaller scores indicate less conservation), and 0.6 for adaptive boosting (ADA) and random forest (RF) scores of dbscSNV as splicing impact (larger scores indicate more likely splice altering).

gromdimon commented 2 months ago

Image with scores: https://www.ncbi.nlm.nih.gov/pmc/articles/instance/9748256/bin/gr4.jpg