Closed komalsrathi closed 8 years ago
Your file name is sample.snpeff.vcf.bgz
; please rename to sample.snpeff.vcf.gz
and let us
know if the problem persists.
Hi,
My file was already bgzipped. I tried the same command on
All three give me the following error:
Traceback (most recent call last):
File "/home/rathik/tools/bin/gemini", line 6, in <module>
gemini.gemini_main.main()
File "/home/rathik/data/anaconda/lib/python2.7/site-packages/gemini/gemini_main.py", line 1185, in main
args.func(parser, args)
File "/home/rathik/data/anaconda/lib/python2.7/site-packages/gemini/gemini_main.py", line 198, in load_fn
gemini_load.load(parser, args)
File "/home/rathik/data/anaconda/lib/python2.7/site-packages/gemini/gemini_load.py", line 24, in load
annos = annotations.get_anno_files(args)
File "/home/rathik/data/anaconda/lib/python2.7/site-packages/gemini/annotations.py", line 17, in get_anno_files
anno_dirname = config["annotation_dir"]
TypeError: 'NoneType' object has no attribute '__getitem__'
you'll need to bgzip the file. not gzip. can you show the full commands that you are running along with the full output.
Thank you. These are my commands:
snpEff -Xmx32g -Xms16g -Djava.io.tmpdir=/users/rathik/scratch -c snpEff.GRCh38.config -ud 10 -classic GRCh38.82 sample.vcf > sample.snpeff.vcf
bgzip -c sample.snpeff.vcf > sample.snpeff.vcf.gz
tabix -p vcf sample.snpeff.vcf.gz
gemini load --cores 3 -t snpEff -v sample.snpeff.vcf.gz -p samples.ped sample.gemini.db
However, no output files are created and I am still getting the following error:
Traceback (most recent call last):
File "/home/rathik/tools/bin/gemini", line 6, in <module>
gemini.gemini_main.main()
File "/home/rathik/data/anaconda/lib/python2.7/site-packages/gemini/gemini_main.py", line 1185, in main
args.func(parser, args)
File "/home/rathik/data/anaconda/lib/python2.7/site-packages/gemini/gemini_main.py", line 198, in load_fn
gemini_load.load(parser, args)
File "/home/rathik/data/anaconda/lib/python2.7/site-packages/gemini/gemini_load.py", line 24, in load
annos = annotations.get_anno_files(args)
File "/home/rathik/data/anaconda/lib/python2.7/site-packages/gemini/annotations.py", line 17, in get_anno_files
anno_dirname = config["annotation_dir"]
TypeError: 'NoneType' object has no attribute '__getitem__'
I would again like to reinstate that gemini_data has hg19 specific whereas my reference is hg38.
If you are trying to use gemini on genome other than hg19, you'll need to follow the example here: http://quinlanlab.org/blog/2016/05/02/gemini-2-progress.html
including setting up your own annotations.
I missed a very important part while reading the documentation. Thank you for your prompt responses. I will close this issue for now.
All that is required is the researcher collect the annotation files relevant to the species and build of interest and create a vcfanno configuration file that dictates the exact annotation files and attributes that are desired.
Hello,
I installed gemini v0.18.3 as follows:
This is giving me the following error:
Note I checked the data folder /home/rathik/data/gemini_data/ and by default, gemini loads hg19/GRCh37 databases whereas for my samples I am using hg38/GRCh38 as reference. Could that be the problem? If yes, how could one load all hg38 databases during installation?
More details To perform tests, I followed the steps below:
I got the same error (line 85 in test_log.txt), the tests went past it but eventually failed. I attached the test.out and test.log file.
test_out.txt test_log.txt
Thanks, Komal