Closed daquang closed 6 years ago
Did you first run tests/data/download_gene_fasta.py
? If so could you please inspect the file to make sure it's not empty? I have a test for this, and it's passing currently so I suspect it's an issue with your chr22.fasta file.
Yes, you are correct. The chr22.fasta file was empty. I manually downloaded the human_b36_male.fa from 1000 genomes and tried it again, and it worked. Perhaps chr22.fasta is not being made correctly because I'm using python 3.6 or I am missing some dependencies?
Also, perhaps you can make FastaVariant a little faster by using cyvcf2 instead of PyVCF?
That’s a great idea. Maybe @brentp can make a suggestion about integration. I’ll look into it.
That would be great if we can we get this going! I have to grab lots of sequences efficiently with SNPs properly replacing single letters, and my VCF files may have millions of variants. Every small speedup helps.
Do you think you can also provide some documentation on what some of the arguments of FastaVariant (e.g. hom, het, call_filter) mean?
Sorry for the delay here. I'm planning to re-write the documentation when I have time and will certainly add more clarification about the variant filters.
I'm trying to run the FastaVariant example in the README, but I'm getting the following error: