Closed moahaegglund closed 3 years ago
Hi! Thank you! I think we might need some context for this one to best help. The first two points are partially pipeline specific. What pipeline and version produced the files you are loading, and if this is in Solna, a customer number and case would be nice to look at everything?
The not-cut-genes-for-large SVs is tricky, as some may have thousands. What we can revisit is to show all genes that appear in the default panel or something such.
For the final point, there really two questions, docs and frequency filter SV overlap/clustering:
We can try to document a bit on the Scout side, but the filtering part is done using annotations from the pipeline, so this is mostly an interaction issue with the pipeline, trying to propagate what is in the pipeline reference set to the end user, perhaps with some comment in a fixed field in the vcf header?
The local frequency display is based on loqusdb clustering so employs a variant size proportional imprecision in break pointst matching. The global frequencies displayed for individual variants from the vcf, eg SweGen and GnomAD, is again pipeline dependent. If this is a pipeline that uses SVDB for SV annotation, as most of our in-house ones do, it is using another clustering method.
Some good points you got there Daniel 💯
Super! I didn't find any somaticscore on recent balsamic runs, so I'll transfer this issue over to Balsamic for now, and open a new one for Scout regarding the display of gene symbols for large structural variants. We could perhaps do a searchable list or just show the ones from given panels.
Manta's tumor-normal analysis has SOAMTICSCORE in the header. Tumor-only mode is missing this value. Since it is a Manta issue and BALSAMIC, I'm closing this.
Scout should parse INFO/SOMATICSCORE
if it exists and show it if needed
##INFO=<ID=SOMATIC,Number=0,Type=Flag,Description="Somatic mutation">
##INFO=<ID=SOMATICSCORE,Number=1,Type=Integer,Description="Somatic variant quality score">
##INFO=<ID=SVINSLEN,Number=.,Type=Integer,Description="Length of insertion">
##INFO=<ID=SVINSSEQ,Number=.,Type=String,Description="Sequence of insertion">
##INFO=<ID=SVLEN,Number=.,Type=Integer,Description="Difference in length between REF and ALT alleles">
This is feedback from a customer with a wish about what to display on the first page for the SVs. There is also a question regarding filtering of SVs against databases.