Questions on data bundle and SV genotyping

bioinformatics-centre / BayesTyper

A method for variant graph genotyping based on exact alignment of k-mers

86 stars 7 forks source link

Hi @jonassibbesen ,

Thanks for providing this great tool.

I am now trying to download the data bundle from the link you provide. However, it always failed after ~1GB data is downloaded no matter which tool I use. For example, wget keep raising error Connection closed at byte 1073725440. Retrying..

Will you consider provide an alternative link for download? Or could you clarify if I generate the reference data for GRCh38 in this way is OK:

Reference genome: put chr1-22, X, Y, chrrandom to canon.fa, put chrUn, chrdecoy to decoy.fa, skip chralt and HLA
Variant prior vcf: sequence resolved site-only vcf

By the way, if I only want to genotype large SVs detected from long read sequencing-based callset, can I skip the variant calling step and estimate SV genotype of short read sequencing samples using SV callset + SNV/INDEL prior file?

Thanks, Han

bioinformatics-centre / BayesTyper

Questions on data bundle and SV genotyping #46