Closed AkshajD closed 1 year ago
There are two solutions:
hs37d5
contig rather than using human_g1k_v37.fasta
bcftools filter
with bcftools filter -t ^hs37d5
to remove variants from the hs37d5
contigThe second error is just the result of the fact that the first command failed and the output VCF was not correctly generated
I have a single test VCF file from WGS that I am running through your data preparation pipeline. The original BAM was aligned on hs37d5 (an alternate form/version of GRCh37 from what I understand), and hence the VCF is also aligned to that version.
When running this first command with the VCF and the input files you indicated/provided for GRCh37 in the following command to "Create the minimal binary VCF":
gives the errors:
Subsequently, the following command to "generate list of variants to be excluded from modelling by both eagle and mocha":
gives the error:
[W::bcf_sr_add_reader] No BGZF EOF marker; file '/home/adarbar/projects/rrg-rauhm/adarbar/MoChA_test/MoChA_Run/MoChA_output/008ca9e31add6629e40396fdf930a2b7.MoChA.unphased.bcf' may be truncated
It seems that this also stops it from generating the index files, preventing Phasing from being able to run.
Can you shed some light on what the problem here might be, and what I can do to fix it?