What is the motivation / use case for changing the behavior?
The feature will allow PharmCAT users to use a compressed reference human genome sequence for VCF normalization. It provides computer storage friendly files, reduces download time and improves preprocessing speed for PharmCAT users.
feature for preprocessor
Preprocessor downloads and decompresses the gzip compressed fasta file of reference human genome from NCBI website.
Retrieve the bgzip compressed fasta file and its index files from https://github.com/PharmGKB/PharmCAT-data instead.
The feature will allow PharmCAT users to use a compressed reference human genome sequence for VCF normalization. It provides computer storage friendly files, reduces download time and improves preprocessing speed for PharmCAT users.