Closed parul-sharma closed 4 years ago
@parul-sharma Thank you for reaching out. One of the VCF files is not in the expected format (missing the position number). It is causing an exception in the pyvcf library. I can try to detect this condition and work around the problem in a future release. For now, I suggest we find and examine the mal-formed VCF file and then we can decide how to proceed. Would it be possible for you to make the VCF files available to me to further diagnose the problem? You can zip the files with the command below.
zip vcf.zip samples/*/var.flt.vcf
Hi Dr Steven, Thanks for your quick reply. Here are the requested vcf files. vcf.zip
There was Java memory allocation error. See samples/UW492/var.flt.vcf. Delete the file samples/UW492/var.flt.vcf and try rerunning the pipeline.
Thanks for finding the issue. I deleted that file and rerun the pipeline. It still gave me the same error but with some other file this time. I also tried running it on cluster with significantly more memory but I see that at least 8 of my samples still end up with corrupted vcf files due to this java memory allocation error. Is there a way to work around this problem. I do understand that this is a little out of scope from your software and I really appreciate you taking the time to help me.
It sounds like concurrent processes are competing for available memory. You can make some adjustments to the configuration file to reduce the number of concurrent processes. See these links for the documentation: https://snp-pipeline.readthedocs.io/en/latest/faq.html#performance https://snp-pipeline.readthedocs.io/en/latest/configuration.html
Changing the parameters in the configuration file totally worked! Thanks for suggesting that.
Hello, I have tested the software on the Listeria genomes with the run option and it worked fine. I am now trying on a set of 50 genomes and the pipeline shuts down each time at the abnormal SNP filtration step with the following error:
I am using the latest version 2.2.0 of snp-pipeline and all other dependencies are the suggested versions as per the documentation.