Thanks for the report and sorry about the issue. Could you try updating to the latest bcbio development version: upgrade -u development
and re-running? We're working to move to Python 3 and the latest development version has some fixes which should help with finding these programs nested inside the python2 conda environment. Please let us know if there are still issues after updating, we're trying to get as much testing as possible with the changes before the next release.
Looks like strelka2 was missing as well, never noticed it until you pointed out the migration to Python3. I ran the upgrade command, but couldn't find the missing tool binaries (strelka2, macs2, platypus etc.) in my $PATH after it finished. Then, I added the --tools
option to the upgrade command. It mentioned that the tools were already installed:
I've just thought of something that might be worth looking into.
I've made a parallel install of bcbio_nextgen on a cluster around the same time that I've created the problematic install that I mentioned here (which is on a cloud VM). I've just checked the cluster install now and the tools are all linked inside the bcbio_nextgen/tools/bin
directory, including strelka2 and platypus et al.
When setting up the VM paths, I created a list of directories and a symlink in order to "mirror" the directory structure from the cluster, so that commands and scripts would work seamlessly on both.
My home directory is /export/home/ncit/external/a.mizeranschi
. I had an "/export" mount point available on the VM and I had already copied a bunch of stuff there before getting access to the cluster, so I thought I'd create the /export/home/ncit/external/
directories and make a symlink there called a.mizeranschi
, which points to the parent directory /export
, such that the paths /export/<stuff>
from the VM and and /export/home/ncit/external/a.mizeranschi/<stuff>
from the cluster are analogous.
Could this setup with a smylink to a parent directory cause the problems I've been seeing? I haven't noticed any other strange things besides the issues with the tooldir in bcbio.
I've created a new bcbio install on the VM (where the symlink setup is) and noticed some errors during the install:
Installing bcbio-nextgen
Installing data and third party dependencies
Traceback (most recent call last):
File "", line 287, in <module>
main(parser.parse_args(), sys.argv[1:])
File "", line 44, in main
subprocess.check_call([bcbio, "upgrade"] + _clean_args(sys_argv, args))
File "/usr/lib64/python2.7/", line 542, in check_call
raise CalledProcessError(retcode, cmd)
subprocess.CalledProcessError: Command '['/export/home/ncit/external/a.mizeranschi/bcbio_nextgen/anaconda/bin/', 'upgrade', '--tooldir=/export/home/ncit/external/a.mizeranschi/bcbio_nextgen/tools', '--genomes', 'sacCer3', '--datatarget', 'variation', '--datatarget', 'rnaseq', '--datatarget', 'smallrna', '--aligners', 'bwa', '--aligners', 'bowtie2', '--aligners', 'hisat2', '--aligners', 'star', '--isolate', '--cores', '10', '--data']' returned non-zero exit status 1
-- Upgrading with the latest development code and installing CWL support
Upgrading bcbio
Traceback (most recent call last):
File "/export/home/ncit/external/a.mizeranschi/bcbio_nextgen/tools/bin/", line 221, in <module>
File "/export/home/ncit/external/a.mizeranschi/bcbio_nextgen/anaconda/lib/python2.7/site-packages/bcbio/", line 56, in upgrade_bcbio
args = add_install_defaults(args)
File "/export/home/ncit/external/a.mizeranschi/bcbio_nextgen/anaconda/lib/python2.7/site-packages/bcbio/", line 706, in add_install_defaults
raise ValueError("Default tool directory not yet saved in config defaults. "
ValueError: Default tool directory not yet saved in config defaults. Specify the '--tooldir=/path/to/tools' to upgrade tools. After a successful upgrade, the '--tools' parameter will work for future upgrades.
I ended up deleting the symlink, creating a directory instead and moving all the stuff there, as I probably should have done in the first place. After reinstalling bcbio_nextgen (with the same commands and paths), it finished alright and the tools are all linked inside bcbio_nextgen/tools/bin
Sorry for wasting your time with this.
Thanks for following up on this. Apologies, some deeper symlinking approaches might break the logic, and it looks like we didn't have a tooldir in this specific case so it failed and never ended up fixing the symlinks.
I appreciate you working around the problem for your case and for the detailed description in case anyone else hits the same issue. Thanks again for all the work debugging this.
I'm getting several errors (below) when running joint VC with Strelka2 in bcbio-nextgen v.1.1.3 (stable). HaplotypeCaller works fine on my input data.
It also looks like Platypus isn't being found (if I try using it instead of HaplotypeCaller or Strelka2) and, indeed, it does look like it's missing from my Bcbio installation, although I remember that I've used it in the past -- has it been left out or is my bcbio installation incomplete?
Here is the YAML config file that I'm using:
And this is the outcome: