Clinical-Genomics / scout

VCF visualization interface
https://clinical-genomics.github.io/scout
BSD 3-Clause "New" or "Revised" License
150 stars 46 forks source link

Integration of chromosomal segments lists for annotation and filtering #1907

Open TJO-Lund opened 4 years ago

TJO-Lund commented 4 years ago

Integration of lists with information about chromosomal segments (including several genes e.g., ”chr22: 19,037,148 - 21,228,744”) for use in Scout in the same manner as we use gene lists. This is essential for better/correct evaluation of SV-variants such as microdeletion and microduplication syndromes. For example we would like to use ClinGen Regions (https://dosage.clinicalgenome.org/pathogenic_region.shtml) for both annotating purposes and filtering. It should also be easy to add similar, in house, segment lists to Scout. Can this be a prioritized issue? In communication with Daniel Nilsson; Tord Jonson Clinical Genomics Lund

dnil commented 4 years ago

Thanks! We have discussed this many times (and there are probably some issues somewhere relating to it in conjunction with a future panel revamp, where we would also like to support repeat expansions).

A more near term workaround would be to allow reading coordinate files and construct gene symbol lists from the genes overlapping them. This will have a slightly different end result, but variants without genes on them are not terribly likely to appear ranked to a level where the operator sees them anyway. Do you have any vital counterexamples where completely intergenic events are established disease causing?

TJO-Lund commented 4 years ago

One axample is this deletion "SHOX Deficiency Disorders, including Leri-Weill dyschondrostosis - critical regulatory element"* (chrX:819,816-867,357) - we have had one such finding with genomic array. Just settling with gene-lists is, IMO, simply suboptimal and I strongly believe that it will be neccessary to work with regions as well as with gene-lists, as our understanding of our genome evolves. Even today it is of paramount importance not to miss atypical deletions (when working with narrow gene lists, such as a cardio-gene list), which occur with many syndromes, for example 22q11 deletion syndrome with other breakpoints than the typical A-D variant. See (https://dosage.clinicalgenome.org/pathogenic_region.shtml).

*Extract from our local database Mimisbrunnr states "Region as defined in PMID: 22791839 "Identification of the first recurrent PAR1 deletion in Léri-Weill dyschondrosteosis and idiopathic short stature reveals the presence of a novel SHOX enhancer" and in PMID: 23636926 (2013) "Diagnostic screening identifies a wide range of mutations involving the SHOX gene, including a common 47.5 kb deletion 160 kb downstream with a variable phenotypic effect". See also Decipher https://decipher.sanger.ac.uk/syndrome/58#overview; OMIM #127300 (heterozygous loss), #249700 (homozygous loss, Langer mesomelic dysplasia) and 300582 (Short stature, idiopathic familial); ORPHA 240 (Léri-Weill dyschondrosteosis), 2632 (Langer mesomelic dysplasia) and 314795 (SHOX-related short stature); GeneReviews PMID: 20301394 "SHOX Deficiency Disorders"; and ClinGen Gene Curation: www.ncbi.nlm.nih.gov/projects/dbvar/clingen/clingen_gene.cgi?sym=SHOX&subject="

TJO-Lund commented 4 years ago

Just an idea from Hans, is it at all possible to include regions and STRs in a single list using "tags" for regions, genes or STRs? That would perhaps make it easier to apply a single list when filtering for different indications.

TJO-Lund commented 4 years ago

Does "Hard" mean that we should give up on this important enhancement or does it inspire someone to reach a difficult goal? :-)

dnil commented 4 years ago

You tell me! 😄

dnil commented 4 years ago

No, but seriously, we have not given it up. It will happen. If you'd like to contribute, that is very welcome, but otherwise it is also good also to know that you consider it important. That SHOX enhancer is a good example. We have some issues with it on the vcf2cytosure track as well - those don't look terribly clean in the PAR-regions - and quite potentially in calling as well.

TJO-Lund commented 4 years ago

Great! I'd love to help out any way I can, and you may include me if you set up a group that will work with this issue.

4WGH commented 1 year ago

We also need to have specific region in our panel, not always we can use a gene(s) with the risk for incidental findings in the SNV page. In panelapp there are ISCAs region with coodinates thast we would like to analyze in the SV view. an exemple https://panelapp.genomicsengland.co.uk/panels/509/region/ISCA-37393-Gain/#!details. could be good to ba able to use them :)

TJO-Lund commented 10 months ago

3 years+ for this issue, any recent updates/progress?

dnil commented 10 months ago

Oh, the question is much older. 😉 Suggestions are welcome!