Closed stephenwilliams22 closed 6 years ago
He @stephenwilliams22 - I not that the sqlachemy error is database or disk is full
: are you sure this isn't your problem?
Thanks for the response Aaron. My disc definitely isn't full (2TB free) and I'm using --passonly
to limit the number of variants. I have run this exact sample with using freebayes (VCF 4.1) and gemini worked fine. When I switched to GATK (VCF 4.2) everything blew up.
Here's my exact script to load the gemini db
gemini load -v my.VEP.vcf \
-t VEP \
--cores 20 \
--skip-gene-tables \
--passonly \
vep.hg19.db
I think your tempdir is full. Try setting --tempdir
to the same path that my.VEP.vcf is writing too.
Thanks Aaron, This seems to have gotten me over the first hump. Now I seem to have a new error. I recently upgraded using conda to 0.20.1 and am getting this error when trying to load the db.
Traceback (most recent call last):
File "/mnt/home/stephen/miniconda2/envs/gemini_env/bin/gemini", line 7, in <module>
gemini_main.main()
File "/mnt/home/stephen/miniconda2/envs/gemini_env/lib/python2.7/site-packages/gemini/gemini_main.py", line 1248, in main
args.func(parser, args)
File "/mnt/home/stephen/miniconda2/envs/gemini_env/lib/python2.7/site-packages/gemini/gemini_main.py", line 204, in load_fn
gemini_load.load(parser, args)
File "/mnt/home/stephen/miniconda2/envs/gemini_env/lib/python2.7/site-packages/gemini/gemini_load.py", line 23, in load
annos = annotations.get_anno_files(args)
File "/mnt/home/stephen/miniconda2/envs/gemini_env/lib/python2.7/site-packages/gemini/annotations.py", line 22, in get_anno_files
anno_dirname = config["annotation_dir"]
KeyError: 'annotation_dir'
Looks like a fresh gemini install may have cured what ails me. That being said, do you have any suggestions on things to look out for with vcf 4.2?
To be honest, I am not aware of any issues with 4.2. Have you seen any so far?
I'm doing some comparisons now but looks okay at first glance. I'm going to close this issue. Thanks Aaron!
I'm doing some comparisons now but looks okay at first glance. I'm going to close this issue. Thanks Aaron!
Hello! Did you notice any differences, or loss of information?
Hi Aaron and Brent, On the the gemini main page you sate "GEMINI is very strict about adherence to VCF format 4.1." However, with the recent update to GATK4 the default, and unmodifiable, output of HaplotypeCaller is VCF 4.2. Do you all have recommendations for working with VCF 4.2? This is causing me a ton of trouble right now.
Find my error output below. I have 2TB of disc space available so I think that this much have to do with the second error "IOError: /dev/stdin if not valid bcf or vcf".
Any help would be greatly appreciated!