Closed amizeranschi closed 5 years ago
Thanks for the report and sorry about the issue. Could you try updating to the latest bcbio development version:
bcbio_nextgen.py upgrade -u development
and re-running? We're working to move to Python 3 and the latest development version has some fixes which should help with finding these programs nested inside the python2 conda environment. Please let us know if there are still issues after updating, we're trying to get as much testing as possible with the changes before the next release.
Looks like strelka2 was missing as well, never noticed it until you pointed out the migration to Python3. I ran the upgrade command, but couldn't find the missing tool binaries (strelka2, macs2, platypus etc.) in my $PATH after it finished. Then, I added the --tools
option to the upgrade command. It mentioned that the tools were already installed:
# Installing into conda environment dv: deepvariant
Collecting package metadata: done
Solving environment: done
# All requested packages already installed.
# Installing into conda environment python2: cpat, dkfz-bias-filter, gemini, gvcf-regions, hap.py, lumpy-sv, macs2, manta, metasv, mirge, optitype, peddy, phylowgs, platypus-variant, sentieon, simple_sv_annotation, smcounter2, smoove, strelka, svtools, svtyper, theta2, tophat, vawk, vcf2db
Collecting package metadata: done
Solving environment: done
# All requested packages already installed.
# Installing into conda environment python3: atropos, crossmap
Collecting package metadata: done
Solving environment: done
# All requested packages already installed.
How are the different conda environments organized in the bcbio_nextgen directory structure and how will this affect the $PATH setup?
Indeed, I can see the respective package directories (e.g. platypus-variant-0.8.1.1-py27hdbffeaa_3 or macs2-2.1.2-py27r351_0) in bcbio_nextgen/anaconda/pkgs/
. However, the corresponding executables don't appear in bcbio_nextgen/anaconda/bin/
or bcbio_nextgen/tools/bin/
.
Actually, bcbio_nextgen/tools/bin/
only contains symlinks to the bcbio scripts: bcbio_fastq_umi_prep.py, bcbio_nextgen.py, bcbio_prepare_samples.py and bcbio_setup_genome.py.
Also, trying to manually run a binary directly from bcbio_nextgen/anaconda/pkgs/
results in an error. Is this to be expected?
/path_to/bcbio_nextgen/anaconda/pkgs/platypus-variant-0.8.1.1-py27hdbffeaa_3/bin/platypus
/path_to/bcbio_nextgen/anaconda/pkgs/platypus-variant-0.8.1.1-py27hdbffeaa_3/bin/platypus: line 17: /opt/anaconda1anaconda2anaconda3/bin/python: No such file or directory
/path_to/bcbio_nextgen/anaconda/pkgs/macs2-2.1.2-py27r351_0/bin/macs2
bash: /path_to/bcbio_nextgen/anaconda/pkgs/macs2-2.1.2-py27r351_0/bin/macs2: /opt/anaconda1anaconda2anaconda3/bin/python: bad interpreter: No such file or directory
Thanks for digging into this in depth and sorry for the continued problems. Since the packages you mention require python 2, they get installed into a separate conda environment: /path_to/bcbio_nextgen/anaconda/envs/python2/bin
rather than the main anaconda/bin
environment. However, they should get linked into the directory specified with --tooldir
, which is the high level PATH you should include with bcbio. There are no symlinks to tools in that directory? I'm confused, as that should happen for everything in the main anaconda environment as well as the sub environments. It is possible the upgrade didn't complete successfully, or something else happened to prevent that? Could you provide the full command you're using to upgrade? Is this with the latest development version or 1.1.3? Thanks again for the help debugging.
It's the latest development, I've run the upgrade a couple of times yesterday.
I found the tools in /path_to/bcbio_nextgen/anaconda/envs/python2/bin
and running e.g. macs2
from there seems to work. However, the tools aren't getting linked inside /path_to/bcbio_nextgen/tools/bin/
, which is what I had specified with --tooldir
when I first installed bcbio. The only stuff in tools/bin
are the four bcbio and bcbio_vm Python scripts.
I'm running upgrades like this: bcbio_nextgen.py upgrade -u development --data --tools --cwl
. I've just run such an upgrade now and here is its full output:
bcbio_nextgen.py upgrade -u development --data --tools --cwl
Upgrading bcbio
Collecting package metadata: ...working... done
Solving environment: ...working... done
# All requested packages already installed.
WARNING: The --force flag will be removed in a future conda release.
See 'conda install --help' for details about the --force-reinstall
and --clobber flags.
Collecting package metadata: done
Solving environment: done
## Package Plan ##
environment location: /export/home/ncit/external/a.mizeranschi/bcbio_nextgen/anaconda
added / updated specs:
- libgcc-ng
Preparing transaction: done
Verifying transaction: done
Executing transaction: done
WARNING: The --force flag will be removed in a future conda release.
See 'conda install --help' for details about the --force-reinstall
and --clobber flags.
Collecting package metadata: done
Solving environment: done
## Package Plan ##
environment location: /export/home/ncit/external/a.mizeranschi/bcbio_nextgen/anaconda
added / updated specs:
- libgcc-ng
Preparing transaction: done
Verifying transaction: done
Executing transaction: done
Upgrading bcbio-nextgen to latest development version
Collecting bcbio-nextgen from git+https://github.com/bcbio/bcbio-nextgen.git#egg=bcbio-nextgen
Cloning https://github.com/bcbio/bcbio-nextgen.git to /tmp/pip-install-I0df9d/bcbio-nextgen
Building wheels for collected packages: bcbio-nextgen
Running setup.py bdist_wheel for bcbio-nextgen ... done
Stored in directory: /tmp/pip-ephem-wheel-cache-gAiVRO/wheels/c0/af/8f/75201c9fda71b27e67f797ef794dbade910690b2e3fa36c16c
Successfully built bcbio-nextgen
Installing collected packages: bcbio-nextgen
Found existing installation: bcbio-nextgen 1.1.4a0
Uninstalling bcbio-nextgen-1.1.4a0:
Successfully uninstalled bcbio-nextgen-1.1.4a0
Successfully installed bcbio-nextgen-1.1.4a0
Upgrade of bcbio-nextgen development code complete.
## CWL support with bcbio-vm
Collecting package metadata: done
Solving environment: done
# All requested packages already installed.
Collecting package metadata: done
Solving environment: done
# All requested packages already installed.
Upgrading third party tools to latest versions
--2019-02-19 13:57:56-- https://github.com/chapmanb/cloudbiolinux/archive/master.tar.gz
Resolving github.com... 140.82.118.4, 140.82.118.3
Connecting to github.com|140.82.118.4|:443... connected.
HTTP request sent, awaiting response... 302 Found
Location: https://codeload.github.com/chapmanb/cloudbiolinux/tar.gz/master [following]
--2019-02-19 13:57:56-- https://codeload.github.com/chapmanb/cloudbiolinux/tar.gz/master
Resolving codeload.github.com... 192.30.253.120, 192.30.253.121
Connecting to codeload.github.com|192.30.253.120|:443... connected.
HTTP request sent, awaiting response... 200 OK
Length: unspecified [application/x-gzip]
Saving to: ‘STDOUT’
0K ........ ........ ........ ........ ........ ........ 3.24M
3072K ........ ........ ........ ....... 13.7M=1.1s
2019-02-19 13:57:58 (4.62 MB/s) - written to stdout [5178650]
Setting up virtual machine
INFO: <cloudbio.flavor.Flavor instance at 0x7f3b12320b00>
INFO: This is a ngs_pipeline_minimal flavor
INFO: Reading default fabricrc.txt
DBG [config.py]: Using config file /export/automated-VC-test/tmpbcbio-install/cloudbiolinux/cloudbio/../config/fabricrc.txt
INFO: Distribution __auto__
INFO: Get local environment
INFO: CentOS setup
DBG [distribution.py]: NixPkgs: Ignored
[localhost] local: echo $HOME
[localhost] local: uname -m
INFO: Now, testing connection to host...
INFO: Connection to host appears to work!
DBG [utils.py]: Expand paths
DBG [config.py]: Using config file /export/automated-VC-test/tmpbcbio-install/cloudbiolinux/cloudbio/../contrib/flavor/ngs_pipeline_minimal/main.yaml
INFO: Meta-package information from /export/automated-VC-test/tmpbcbio-install/cloudbiolinux/cloudbio/../contrib/flavor/ngs_pipeline_minimal/main.yaml
- Packages: minimal,libraries,perl,python,java,r,ruby,bio_nextgen
- Libraries: r-libs
DBG [config.py]: Using config file /export/automated-VC-test/tmpbcbio-install/cloudbiolinux/cloudbio/../contrib/flavor/ngs_pipeline_minimal/custom.yaml
Reading packages from /export/automated-VC-test/tmpbcbio-install/cloudbiolinux/cloudbio/../contrib/flavor/ngs_pipeline_minimal/custom.yaml
DBG [config.py]: Using config file /export/automated-VC-test/tmpbcbio-install/cloudbiolinux/cloudbio/../config/chef_recipes.yaml
Reading packages from /export/automated-VC-test/tmpbcbio-install/cloudbiolinux/cloudbio/../config/chef_recipes.yaml
DBG [config.py]: Using config file /export/automated-VC-test/tmpbcbio-install/cloudbiolinux/cloudbio/../config/puppet_classes.yaml
Reading packages from /export/automated-VC-test/tmpbcbio-install/cloudbiolinux/cloudbio/../config/puppet_classes.yaml
DBG [config.py]: Using config file /export/automated-VC-test/tmpbcbio-install/cloudbiolinux/cloudbio/../contrib/flavor/ngs_pipeline_minimal/main.yaml
INFO: Meta-package information from /export/automated-VC-test/tmpbcbio-install/cloudbiolinux/cloudbio/../contrib/flavor/ngs_pipeline_minimal/main.yaml
- Packages: minimal,libraries,perl,python,java,r,ruby,bio_nextgen
- Libraries: r-libs
DBG [config.py]: Using config file /export/automated-VC-test/tmpbcbio-install/cloudbiolinux/cloudbio/../contrib/flavor/ngs_pipeline_minimal/packages-homebrew.yaml
Reading packages from /export/automated-VC-test/tmpbcbio-install/cloudbiolinux/cloudbio/../contrib/flavor/ngs_pipeline_minimal/packages-homebrew.yaml
DBG [config.py]: Using config file /export/automated-VC-test/tmpbcbio-install/cloudbiolinux/cloudbio/../contrib/flavor/ngs_pipeline_minimal/main.yaml
INFO: Meta-package information from /export/automated-VC-test/tmpbcbio-install/cloudbiolinux/cloudbio/../contrib/flavor/ngs_pipeline_minimal/main.yaml
- Packages: minimal,libraries,perl,python,java,r,ruby,bio_nextgen
- Libraries: r-libs
DBG [config.py]: Using config file /export/automated-VC-test/tmpbcbio-install/cloudbiolinux/cloudbio/../contrib/flavor/ngs_pipeline_minimal/packages-conda.yaml
Reading packages from /export/automated-VC-test/tmpbcbio-install/cloudbiolinux/cloudbio/../contrib/flavor/ngs_pipeline_minimal/packages-conda.yaml
Checking for problematic or migrated packages in default environment
# Installing into conda environment default: age-metasv, bamtools=2.4.0, bamutil, bbmap, bcbio-prioritize, bcbio-rnaseq, bcbio-variation, bcbio-variation-recall, bcftools, bedtools=2.27.1, bio-vcf, biobambam, bowtie, bowtie2, break-point-inspector, bwa, bwakit, cage, cancerit-allelecount, chipseq-greylist, cnvkit, coincbc, cramtools, cufflinks, cutadapt=1.16, cyvcf2, deeptools, delly, duphold>=0.0.9, ensembl-vep=95.*, express, extract-sv-reads, fastp, fastqc>=0.11.8=1, fgbio, freebayes=1.1.0.46, gatk, gatk4, geneimpacts, genesplicer, gffcompare, goleft, grabix, gridss, gsort, gvcfgenotyper, h5py, hisat2, hmftools-amber, hmftools-cobalt, hmftools-purple, hmmlearn, hts-nim-tools, htseq=0.9.1, htslib, impute2, kallisto>=0.43.1, kraken, lofreq, maxentscan, mbuffer, minimap2, mintmap, mirdeep2=2.0.0.7, mirtop, moreutils, mosdepth, multiqc, multiqc-bcbio, ngs-disambiguate, novoalign, octopus>=0.5.1b, oncofuse, parallel, pbgzip, perl-sanger-cgp-battenberg, picard, pindel, pizzly, preseq, pyloh, pysam>=0.14.0, pythonpy, qsignature, qualimap, rapmap, razers3=3.5.0, rtg-tools, sailfish, salmon, sambamba, samblaster, samtools, scalpel, seq2c, seqbuster, seqtk, sickle-trim, simple_sv_annotation, singlecell-barcodes, snap-aligner=1.0dev.97, snpeff=4.3.1t, solvebio, spades, star, stringtie, subread, survivor, tdrmapper, tophat-recondition, ucsc-bedgraphtobigwig, ucsc-bedtobigbed, ucsc-bigbedinfo, ucsc-bigbedsummary, ucsc-bigbedtobed, ucsc-bigwiginfo, ucsc-bigwigsummary, ucsc-bigwigtobedgraph, ucsc-bigwigtowig, ucsc-fatotwobit, ucsc-gtftogenepred, ucsc-liftover, ucsc-wigtobigwig, umis, vardict, vardict-java, variantbam, varscan, vcfanno, vcflib, verifybamid2, viennarna, vqsr_cnn, vt, wham, anaconda-client, awscli, bzip2, ncurses, nodejs, p7zip, readline, s3gof3r, xz, perl-app-cpanminus, perl-archive-extract, perl-archive-zip, perl-bio-db-sam, perl-cgi, perl-dbi, perl-encode-locale, perl-file-fetch, perl-file-sharedir, perl-file-sharedir-install, perl-ipc-system-simple, perl-lwp-protocol-https, perl-lwp-simple, perl-statistics-descriptive, perl-time-hires, perl-vcftools-vcf, bioconductor-annotate, bioconductor-biocgenerics, bioconductor-biocinstaller, bioconductor-biocstyle, bioconductor-biostrings, bioconductor-biovizbase, bioconductor-bsgenome.hsapiens.ucsc.hg19, bioconductor-bsgenome.hsapiens.ucsc.hg38, bioconductor-bubbletree, bioconductor-cn.mops, bioconductor-copynumber, bioconductor-degreport, bioconductor-deseq2, bioconductor-dexseq, bioconductor-dnacopy, bioconductor-genomeinfodbdata, bioconductor-genomicranges, bioconductor-iranges, bioconductor-limma, bioconductor-purecn, bioconductor-rtracklayer, bioconductor-titancna, r-base=3.4.1=h4fe35fd_8, r-basejump>=0.5.8, r-bcbiornaseq>=0.2.7, r-cghflasso, r-chbutils, r-devtools, r-dplyr, r-dt, r-ggdendro, r-ggplot2, r-ggrepel>=0.7, r-gplots, r-gsalib, r-knitr, r-pheatmap, r-plyr, r-pscbs, r-reshape, r-rmarkdown, r-rsqlite, r-sleuth, r-snow, r-stringi, r-viridis>=0.5, r-wasabi, r=3.4.1, xorg-libxt
Collecting package metadata: done
Solving environment: done
## Package Plan ##
environment location: /export/home/ncit/external/a.mizeranschi/bcbio_nextgen/anaconda
added / updated specs:
- age-metasv
- anaconda-client
- awscli
- bamtools=2.4.0
- bamutil
- bbmap
- bcbio-prioritize
- bcbio-rnaseq
- bcbio-variation
- bcbio-variation-recall
- bcftools
- bedtools=2.27.1
- bio-vcf
- biobambam
- bioconductor-annotate
- bioconductor-biocgenerics
- bioconductor-biocinstaller
- bioconductor-biocstyle
- bioconductor-biostrings
- bioconductor-biovizbase
- bioconductor-bsgenome.hsapiens.ucsc.hg19
- bioconductor-bsgenome.hsapiens.ucsc.hg38
- bioconductor-bubbletree
- bioconductor-cn.mops
- bioconductor-copynumber
- bioconductor-degreport
- bioconductor-deseq2
- bioconductor-dexseq
- bioconductor-dnacopy
- bioconductor-genomeinfodbdata
- bioconductor-genomicranges
- bioconductor-iranges
- bioconductor-limma
- bioconductor-purecn
- bioconductor-rtracklayer
- bioconductor-titancna
- bowtie
- bowtie2
- break-point-inspector
- bwa
- bwakit
- bzip2
- cage
- cancerit-allelecount
- chipseq-greylist
- cnvkit
- coincbc
- cramtools
- cufflinks
- cutadapt=1.16
- cyvcf2
- deeptools
- delly
- duphold[version='>=0.0.9']
- ensembl-vep=95
- express
- extract-sv-reads
- fastp
- fastqc[version='>=0.11.8',build=1]
- fgbio
- freebayes=1.1.0.46
- gatk
- gatk4
- geneimpacts
- genesplicer
- gffcompare
- goleft
- grabix
- gridss
- gsort
- gvcfgenotyper
- h5py
- hisat2
- hmftools-amber
- hmftools-cobalt
- hmftools-purple
- hmmlearn
- hts-nim-tools
- htseq=0.9.1
- htslib
- impute2
- kallisto[version='>=0.43.1']
- kraken
- lofreq
- maxentscan
- mbuffer
- minimap2
- mintmap
- mirdeep2=2.0.0.7
- mirtop
- moreutils
- mosdepth
- multiqc
- multiqc-bcbio
- ncurses
- ngs-disambiguate
- nodejs
- novoalign
- octopus[version='>=0.5.1b']
- oncofuse
- p7zip
- parallel
- pbgzip
- perl-app-cpanminus
- perl-archive-extract
- perl-archive-zip
- perl-bio-db-sam
- perl-cgi
- perl-dbi
- perl-encode-locale
- perl-file-fetch
- perl-file-sharedir
- perl-file-sharedir-install
- perl-ipc-system-simple
- perl-lwp-protocol-https
- perl-lwp-simple
- perl-sanger-cgp-battenberg
- perl-statistics-descriptive
- perl-time-hires
- perl-vcftools-vcf
- picard
- pindel
- pizzly
- preseq
- pyloh
- pysam[version='>=0.14.0']
- python=2
- pythonpy
- qsignature
- qualimap
- r-base==3.4.1=h4fe35fd_8
- r-basejump[version='>=0.5.8']
- r-bcbiornaseq[version='>=0.2.7']
- r-cghflasso
- r-chbutils
- r-devtools
- r-dplyr
- r-dt
- r-ggdendro
- r-ggplot2
- r-ggrepel[version='>=0.7']
- r-gplots
- r-gsalib
- r-knitr
- r-pheatmap
- r-plyr
- r-pscbs
- r-reshape
- r-rmarkdown
- r-rsqlite
- r-sleuth
- r-snow
- r-stringi
- r-viridis[version='>=0.5']
- r-wasabi
- r=3.4.1
- rapmap
- razers3=3.5.0
- readline
- rtg-tools
- s3gof3r
- sailfish
- salmon
- sambamba
- samblaster
- samtools
- scalpel
- seq2c
- seqbuster
- seqtk
- sickle-trim
- simple_sv_annotation
- singlecell-barcodes
- snap-aligner=1.0dev.97
- snpeff=4.3.1t
- solvebio
- spades
- star
- stringtie
- subread
- survivor
- tdrmapper
- tophat-recondition
- ucsc-bedgraphtobigwig
- ucsc-bedtobigbed
- ucsc-bigbedinfo
- ucsc-bigbedsummary
- ucsc-bigbedtobed
- ucsc-bigwiginfo
- ucsc-bigwigsummary
- ucsc-bigwigtobedgraph
- ucsc-bigwigtowig
- ucsc-fatotwobit
- ucsc-gtftogenepred
- ucsc-liftover
- ucsc-wigtobigwig
- umis
- vardict
- vardict-java
- variantbam
- varscan
- vcfanno
- vcflib
- verifybamid2
- viennarna
- vqsr_cnn
- vt
- wham
- xorg-libxt
- xz
The following NEW packages will be INSTALLED:
simple_sv_annotat~ bioconda/noarch::simple_sv_annotation-2019.02.18-py_0
The following packages will be UPDATED:
blas pkgs/main::blas-1.0-mkl --> conda-forge::blas-1.1-openblas
coincbc 2.9.8-1 --> 2.9.9-blas_openblash35bd731_1002
gsl pkgs/main::gsl-2.2.1-h0c605f7_3 --> conda-forge::gsl-2.2.1-blas_openblash47a8a8e_5
The following packages will be SUPERSEDED by a higher-priority channel:
numpy pkgs/main::numpy-1.15.4-py27h7e9f1db_0 --> conda-forge::numpy-1.15.2-py27_blas_openblashd3ea46f_1
scikit-learn pkgs/main::scikit-learn-0.20.2-py27hd~ --> conda-forge::scikit-learn-0.20.0-py27_blas_openblasha84fab4_201
scipy pkgs/main::scipy-1.2.0-py27h7c811a0_0 --> conda-forge::scipy-1.1.0-py27_blas_openblash7943236_201
The following packages will be DOWNGRADED:
numpy-base 1.15.4-py27hde5b4d6_0 --> 1.15.4-py27h2f8d375_0
Preparing transaction: done
Verifying transaction: done
Executing transaction: done
# Installing into conda environment dv: deepvariant
Collecting package metadata: done
Solving environment: done
# All requested packages already installed.
# Installing into conda environment python2: cpat, dkfz-bias-filter, gemini, gvcf-regions, hap.py, lumpy-sv, macs2, manta, metasv, mirge, optitype, peddy, phylowgs, platypus-variant, sentieon, smcounter2, smoove, strelka, svtools, svtyper, theta2, tophat, vawk, vcf2db
Collecting package metadata: done
Solving environment: done
# All requested packages already installed.
# Installing into conda environment python3: atropos, crossmap
Collecting package metadata: done
Solving environment: done
# All requested packages already installed.
# Installing into conda environment samtools0: ericscript
Collecting package metadata: done
Solving environment: done
# All requested packages already installed.
DBG [config.py]: Using config file /export/automated-VC-test/tmpbcbio-install/cloudbiolinux/cloudbio/../contrib/flavor/ngs_pipeline_minimal/r-libs.yaml
Creating manifest of installed packages in /export/bcbio_nextgen/manifest
Third party tools upgrade complete.
Upgrading bcbio-nextgen data files
--2019-02-19 14:07:54-- https://github.com/chapmanb/cloudbiolinux/archive/master.tar.gz
Resolving github.com... 140.82.118.4, 140.82.118.3
Connecting to github.com|140.82.118.4|:443... connected.
HTTP request sent, awaiting response... 302 Found
Location: https://codeload.github.com/chapmanb/cloudbiolinux/tar.gz/master [following]
--2019-02-19 14:07:55-- https://codeload.github.com/chapmanb/cloudbiolinux/tar.gz/master
Resolving codeload.github.com... 192.30.253.120, 192.30.253.121
Connecting to codeload.github.com|192.30.253.120|:443... connected.
HTTP request sent, awaiting response... 200 OK
Length: unspecified [application/x-gzip]
Saving to: ‘STDOUT’
0K ........ ........ ........ ........ ........ ........ 2.93M
3072K ........ ........ ........ ....... 13.8M=1.2s
2019-02-19 14:07:56 (4.24 MB/s) - written to stdout [5178650]
Setting up virtual machine
INFO: <cloudbio.flavor.Flavor instance at 0x7f3b121d42d8>
INFO: <cloudbio.flavor.Flavor instance at 0x7f3b121d42d8>
INFO: This is a ngs_pipeline_minimal flavor
INFO: This is a ngs_pipeline_minimal flavor
INFO: Distribution __auto__
INFO: Distribution __auto__
INFO: Get local environment
INFO: Get local environment
INFO: CentOS setup
INFO: CentOS setup
WARN [distribution.py(206)]: NixPkgs are currently not supported for centos
WARN [distribution.py(206)]: NixPkgs are currently not supported for centos
DBG [distribution.py]: NixPkgs: Ignored
DBG [distribution.py]: NixPkgs: Ignored
[localhost] local: echo $HOME
[localhost] local: uname -m
INFO: Now, testing connection to host...
INFO: Now, testing connection to host...
INFO: Connection to host appears to work!
INFO: Connection to host appears to work!
DBG [utils.py]: Expand paths
DBG [utils.py]: Expand paths
List of genomes to get (from the config file at '{'install_liftover': False, 'genome_indexes': ['bwa', 'rtg'], 'genomes': [{'name': 'Human (hg38) full', 'validation': ['giab-NA12878', 'giab-NA24385', 'giab-NA24631', 'platinum-genome-NA12878', 'giab-NA12878-remap', 'giab-NA12878-crossmap', 'dream-syn4-crossmap', 'dream-syn3-crossmap', 'giab-NA12878-NA24385-somatic', 'giab-NA24143', 'giab-NA24149'], 'annotations': ['ccds', 'capture_regions', 'coverage', 'prioritize', 'dbsnp', 'hapmap_snps', '1000g_omni_snps', 'ACMG56_genes', '1000g_snps', 'mills_indels', '1000g_indels', 'clinvar', 'qsignature', 'genesplicer', 'effects_transcripts', 'varpon', 'vcfanno', 'viral'], 'dbkey': 'hg38', 'indexes': ['seq', 'twobit', 'bwa', 'hisat2']}], 'install_uniref': False}'): Human (hg38) full
bcbio-nextgen data upgrade complete.
Isolated tool installation not automatically added to environmental variables
Add:
/export/home/ncit/external/a.mizeranschi/bcbio_nextgen/tools/bin to PATH
Upgrade completed successfully.
I've just thought of something that might be worth looking into.
I've made a parallel install of bcbio_nextgen on a cluster around the same time that I've created the problematic install that I mentioned here (which is on a cloud VM). I've just checked the cluster install now and the tools are all linked inside the bcbio_nextgen/tools/bin
directory, including strelka2 and platypus et al.
When setting up the VM paths, I created a list of directories and a symlink in order to "mirror" the directory structure from the cluster, so that commands and scripts would work seamlessly on both.
My home directory is /export/home/ncit/external/a.mizeranschi
. I had an "/export" mount point available on the VM and I had already copied a bunch of stuff there before getting access to the cluster, so I thought I'd create the /export/home/ncit/external/
directories and make a symlink there called a.mizeranschi
, which points to the parent directory /export
, such that the paths /export/<stuff>
from the VM and and /export/home/ncit/external/a.mizeranschi/<stuff>
from the cluster are analogous.
Could this setup with a smylink to a parent directory cause the problems I've been seeing? I haven't noticed any other strange things besides the issues with the tooldir in bcbio.
I've created a new bcbio install on the VM (where the symlink setup is) and noticed some errors during the install:
Installing bcbio-nextgen
Installing data and third party dependencies
Traceback (most recent call last):
File "bcbio_nextgen_install.py", line 287, in <module>
main(parser.parse_args(), sys.argv[1:])
File "bcbio_nextgen_install.py", line 44, in main
subprocess.check_call([bcbio, "upgrade"] + _clean_args(sys_argv, args))
File "/usr/lib64/python2.7/subprocess.py", line 542, in check_call
raise CalledProcessError(retcode, cmd)
subprocess.CalledProcessError: Command '['/export/home/ncit/external/a.mizeranschi/bcbio_nextgen/anaconda/bin/bcbio_nextgen.py', 'upgrade', '--tooldir=/export/home/ncit/external/a.mizeranschi/bcbio_nextgen/tools', '--genomes', 'sacCer3', '--datatarget', 'variation', '--datatarget', 'rnaseq', '--datatarget', 'smallrna', '--aligners', 'bwa', '--aligners', 'bowtie2', '--aligners', 'hisat2', '--aligners', 'star', '--isolate', '--cores', '10', '--data']' returned non-zero exit status 1
-- Upgrading with the latest development code and installing CWL support
Upgrading bcbio
Traceback (most recent call last):
File "/export/home/ncit/external/a.mizeranschi/bcbio_nextgen/tools/bin/bcbio_nextgen.py", line 221, in <module>
install.upgrade_bcbio(kwargs["args"])
File "/export/home/ncit/external/a.mizeranschi/bcbio_nextgen/anaconda/lib/python2.7/site-packages/bcbio/install.py", line 56, in upgrade_bcbio
args = add_install_defaults(args)
File "/export/home/ncit/external/a.mizeranschi/bcbio_nextgen/anaconda/lib/python2.7/site-packages/bcbio/install.py", line 706, in add_install_defaults
raise ValueError("Default tool directory not yet saved in config defaults. "
ValueError: Default tool directory not yet saved in config defaults. Specify the '--tooldir=/path/to/tools' to upgrade tools. After a successful upgrade, the '--tools' parameter will work for future upgrades.
I ended up deleting the symlink, creating a directory instead and moving all the stuff there, as I probably should have done in the first place. After reinstalling bcbio_nextgen (with the same commands and paths), it finished alright and the tools are all linked inside bcbio_nextgen/tools/bin
.
Sorry for wasting your time with this.
Thanks for following up on this. Apologies, some deeper symlinking approaches might break the logic, and it looks like we didn't have a tooldir in this specific case so it failed and never ended up fixing the symlinks.
I appreciate you working around the problem for your case and for the detailed description in case anyone else hits the same issue. Thanks again for all the work debugging this.
Hello,
I'm getting several errors (below) when running joint VC with Strelka2 in bcbio-nextgen v.1.1.3 (stable). HaplotypeCaller works fine on my input data.
It also looks like Platypus isn't being found (if I try using it instead of HaplotypeCaller or Strelka2) and, indeed, it does look like it's missing from my Bcbio installation, although I remember that I've used it in the past -- has it been left out or is my bcbio installation incomplete?
Here is the YAML config file that I'm using:
And this is the outcome: