exomiser / Exomiser

A Tool to Annotate and Prioritize Exome Variants
https://exomiser.readthedocs.io
GNU Affero General Public License v3.0
202 stars 55 forks source link

CADD score normalization #566

Open icooperstein opened 2 months ago

icooperstein commented 2 months ago

I am wondering if you can explain/look into how CADD scores are normalized to have a scale from 0-1 when used as a pathogenicity predictor. I have noticed that when I combine CADD with other tools, CADD is the max path score for almost every variant, regardless of variant type. I've also noticed when I inspect the output files, these CADD scores are often >0.97.

One example: One variant in my result had the following scores: CADD=0.97435516,REVEL=0.031,MVP=0.19391714,ALPHA_MISSENSE=0.0944 I looked up this variant in CADD, and its CADD Phred score is 15.

However, another variant had a Exomiser CADD=0.99748814 in the output, but a CADD Phred score of 26. Since Phred scores are logarithmic, the difference between 15 and 26 is much more drastic than I am seeing with these scaled scores in the output.

We would really like to use CADD scores, especially since many other predictors are for missense variants only.

julesjacobsen commented 1 month ago

The CADD normalisation is here:

https://github.com/exomiser/Exomiser/blob/7a4dc1e06bf489f7815a873ef6c34a83c3b2e99b/exomiser-core/src/main/java/org/monarchinitiative/exomiser/core/model/pathogenicity/CaddScore.java#L30-L43

AlistairNWard commented 1 month ago

Thanks for posting this @julesjacobsen. This makes some sense, but I'm not sure that it results in the desired behaviour. The CADD scaling will need to, as far as is reasonable, put the CADD scores on the same scale as the other pathogenicity scores. For example, a 0.9 for CADD should be largely equivalent to a 0.9 for REVEL. If this is not the case, then one pathogenicity source will outweigh all the others - which is what we see. If we include CADD, then it generally scores very high and will almost always be selected over other sources. I had a quick look at a number of variants with a REVEL score of ~0.9 and they all had corresponding CADD scores of ~30. By the scoring method above, a CADD score of 10 would be scaled to 0.9 and so it is not surprising that CADD heavily dominates. I think this scaling would benefit from a rethink. Is this something that we should discuss?