Closed DOH-JDJ0303 closed 7 months ago
Looks like it is trying to fit the model with RAxML
, maybe try --model-fitter iqtree
?
Hi @nickjcroucher. Oops, yep it looks like there was an error in my script that was still allowing RAxML
to be called but that does not appear to be the source of the issue. I re-ran everything using IQTREE
and I am still getting the same error, which again appears to be an issue with Pyjar
.
Note: I removed some of the output because it contained sensitive sample names
IQ-TREE multicore version 2.2.5 COVID-edition for Linux 64-bit built Sep 15 2023 Developed by Bui Quang Minh, James Barbetti, Nguyen Lam Tung, Olga Chernomor, Heiko Schmidt, Dominik Schrempf, Michael Woodhams, Ly Trong Nhan.
Host: ip-172-31-7-51.us-west-2.compute.internal (AVX2, FMA3, 31 GB RAM) Command: iqtree -nt 8 -safe -redo -m GTR+G4 -s 1712094029-Acinetobacter_baumannii-00001.clean.aln.snp_sites.aln -t /tmp/nxf.XXXXUvAFIB/tmpgp_7rpzw/1712094029-Acinetobacter_baumannii-00001.clean.iteration_1.tre.rooted --prefix /tmp/nxf.XXXXUvAFIB/tmpgp_7rpzw/1712094029-Acinetobacter_baumannii-00001.clean.iteration_1 -n 0 --mlrate -redo Seed: 793205 (Using SPRNG - Scalable Parallel Random Number Generator) Time: Tue Apr 2 22:02:10 2024 Kernel: Safe AVX+FMA - 8 threads (8 CPU cores detected)
Reading alignment file 1712094029-Acinetobacter_baumannii-00001.clean.aln.snp_sites.aln ... Fasta format detected Reading fasta file: done in 0.093331 secs using 86.53% CPU Alignment most likely contains DNA/RNA sequences Constructing alignment: done in 0.111174 secs using 86.4% CPU Alignment has 105 sequences with 35458 columns, 10918 distinct patterns 22089 parsimony-informative, 13369 singleton sites, 0 constant sites Gap/Ambiguity Composition p-value Analyzing sequences: done in 0.00187732 secs using 172.2% CPU
[ REMOVED SENSITIVE INFO]
**** TOTAL 7.04% 0 sequences failed composition chi2 test (p-value<5%; df=3) Checking for duplicate sequences: done in 0.00864522 secs using 263% CPU Reading input tree file /tmp/nxf.XXXXUvAFIB/tmpgp_7rpzw/1712094029-Acinetobacter_baumannii-00001.clean.iteration_1.tre.rooted ... rooted tree
NOTE: 148 MB RAM (0 GB) is required! Estimate model parameters (epsilon = 0.010)
Total number of iterations: 0 CPU time used for tree search: 0.000 sec (0h:0m:0s) Wall-clock time used for tree search: 0.000 sec (0h:0m:0s) Total CPU time used: 107.623 sec (0h:1m:47s) Total wall-clock time used: 13.924 sec (0h:0m:13s)
Analysis results written to: IQ-TREE report: /tmp/nxf.XXXXUvAFIB/tmpgp_7rpzw/1712094029-Acinetobacter_baumannii-00001.clean.iteration_1.iqtree Maximum-likelihood tree: /tmp/nxf.XXXXUvAFIB/tmpgp_7rpzw/1712094029-Acinetobacter_baumannii-00001.clean.iteration_1.treefile Site-specific rates: /tmp/nxf.XXXXUvAFIB/tmpgp_7rpzw/1712094029-Acinetobacter_baumannii-00001.clean.iteration_1.rate Screen log file: /tmp/nxf.XXXXUvAFIB/tmpgp_7rpzw/1712094029-Acinetobacter_baumannii-00001.clean.iteration_1.log
Date and Time: Tue Apr 2 22:02:25 2024
--- Gubbins 3.3.1 ---
Croucher N. J., Page A. J., Connor T. R., Delaney A. J., Keane J. A., Bentley S. D., Parkhill J., Harris S.R. "Rapid phylogenetic analysis of large samples of recombinant bacterial whole genome sequences using Gubbins". Nucleic Acids Res. 2015 Feb 18;43(3):e15. doi: 10.1093/nar/gku1196.
Checking dependencies and input files...
Checking input alignment file...
Filtering input alignment... ...done. Run time: 22.07 s
Running Gubbins to detect SNPs... gubbins /tmp/nxf.XXXXUvAFIB/tmpgp_7rpzw/1712094029-Acinetobacter_baumannii-00001.clean.aln ...done. Run time: 37.67 s
Entering the main loop.
Iteration 1
Constructing the phylogenetic tree with iqtree... iqtree -nt 8 -safe -redo -m GTR+I+G -s /tmp/nxf.XXXXUvAFIB/1712094029-Acinetobacter_baumannii-00001.clean.aln.phylip -pre 1712094029-Acinetobacter_baumannii-00001.clean.iteration_1 -quiet ...done. Run time: 322.68 s
Reconstructing ancestral sequences with pyjar...
Fitting substitution model to tree... iqtree -nt 8 -safe -redo -m GTR+G4 -s 1712094029-Acinetobacter_baumannii-00001.clean.aln.snp_sites.aln -t /tmp/nxf.XXXXUvAFIB/tmpgp_7rpzw/1712094029-Acinetobacter_baumannii-00001.clean.iteration_1.tre.rooted --prefix /tmp/nxf.XXXXUvAFIB/tmpgp_7rpzw/1712094029-Acinetobacter_baumannii-00001.clean.iteration_1 -n 0 --mlrate -redo
Running joint ancestral reconstruction with pyjar .command.sh: line 7: 159 Bus error (core dumped) run_gubbins.py --threads 8 --prefix 1712094029-Acinetobacter_baumannii-00001 --model-fitter iqtree --tree-builder iqtree --custom-model GTR+I+G 1712094029-Acinetobacter_baumannii-00001.clean.aln
The gamma distribution alpha parameter is still rising to almost ~1,000, which makes me suspect there are some hypervariable sites in your alignment that are difficult to include in a joint reconstruction. I would recommend taking a look at the data to check if there's anything odd about your alignment, but if you're confident it is correct, I would switch to a marginal reconstruction with the --mar
flag.
Thanks for the suggestion. What I failed to mention before was that this is running within Nextflow, which appears to be important because I was able to run Gubbins on the same alignment when used outside of Nextflow. I will circle back with a solution once I find one. Thanks for your help!
Thanks for letting me know - please reopen if there is an issue on the Gubbins side.
Hi @nickjcroucher. I wanted to follow up on this issue. Turns out it was not a Nextflow problem but a containerization problem. It seems pyjar
requires more shared memory than what is allocated by Docker/Podman/Singularity by default (I believe all are 64 MB?). Below are examples of how to increase the shared memory size:
--shm-size 1g
to your Docker/Podman command.sessiondir max size = 1000
in /usr/local/etc/singularity/singularity.conf
The specific resource value needed may change but these worked for my applications.
Thanks @DOH-JDJ0303, that's very helpful - hope everything's working now!
Hello. Pyjar appears to be causing a core dump. This may be related to this issue. This was run on the cleaned output of Snippy from 104 A. baumannii isolates. Also tried with 16 cores with 32 GB memory and got the same issue. I have not tried with the most recent version due to ongoing errors with Numba.
version: 3.3.1 container: https://depot.galaxyproject.org/singularity/gubbins%3A3.3.1--py39pl5321h3d4b85c_0 command:
error: