Trying to extract a vcf out of an annotated file (single sample). The number of entries is smaller than the ones in gui (excel, tsv are correct).
This is true when reports are obtained from gui as well as from command line.
No filtering was applied at this point.
The objective is to extract certain variants from priority genes to pass the vcf to a different annotator, which imposes data size limitations.
Procedure was:
Annotate a wgs or wes with many annotators, including clinvar & gnomad3 in windows (oc v. 2.8)
Shrink sqlite with a filter (clinvar-no benign; gnomad <0.01; set of priority genes) in command line using oc util.
Reimport smaller sqlite in oc gui and verify number of variants. Also, open smaller sqlite in DB Browser (a viewer for sqlite) and verify selected variants. All OK so far.
Export vcf, tsv, excel from gui and from command line. Both procedures give the same results: VCF is missing some variants (~20%) compared to those in the smaller sqlite and in tsv and excel exports. No pattern of exclusion can be observed - low and high quality, SNP and InDels, all chromosomes, same genes - all are equaly or randomly selected in or out.
Is there any reason that exporting in vcf format may filter or miss certain variants? Perhaps I did something wrong.
I would like to obtain the same number of variants as in the smaller filtered sqlite file. Ideally I would also like to remove all previous annotators to further shrink the data file.
Thanks
Hi @clinicalngs our IT team could not recreate your issue. Are you able to share your VCF file with us via email (support@opencravat.org) so we can better troubleshoot the issue?
Trying to extract a vcf out of an annotated file (single sample). The number of entries is smaller than the ones in gui (excel, tsv are correct). This is true when reports are obtained from gui as well as from command line. No filtering was applied at this point.
The objective is to extract certain variants from priority genes to pass the vcf to a different annotator, which imposes data size limitations. Procedure was:
Is there any reason that exporting in vcf format may filter or miss certain variants? Perhaps I did something wrong. I would like to obtain the same number of variants as in the smaller filtered sqlite file. Ideally I would also like to remove all previous annotators to further shrink the data file. Thanks