Closed jbedo closed 2 years ago
Thanks for the bug report. Could you try running on the problem region with a single thread and generate a debug log:
$ octopus -R ref.fa -I *.bam -o $out --bamout $evidence --fast --max-genotypes 1000 -T chr5:7299733-7496937 -N $normal --annotations AF --debug
Apologies for the long delay, we had a storage failure and I didn't have compute time to spare to track this down until now.
I'm actually having trouble reproducing this exact error. I am however consistently generating segfaults that look like this:
builder for '/nix/store/5bykn0gi4dyncpg9dpnn81far91447jj-bionix-octopus-callSomatic.drv' failed with exit code 139; last 10 log lines:
[2021-06-15 22:46:16] <INFO> position | completed | taken | ttc
[2021-06-15 22:46:16] <INFO> ------------------------------------------------------------------------
[2021-06-15 22:47:16] <INFO> chr9:550055 0.5% 59s 3h 17m
[2021-06-15 22:47:54] <INFO> chr9:3065110 1.0% 1m 37s 2h 41m
[2021-06-15 22:48:32] <INFO> chr9:1108165 1.5% 2m 15s 2h 28m
[2021-06-15 22:49:16] <INFO> chr9:1018931 2.0% 2m 59s 2h 26m
[2021-06-15 22:49:59] <INFO> chr9:3730990 2.5% 3m 42s 2h 24m
/nix/store/13i375llk80lry9g29szivsz9jap7d3p-stdenv-linux/setup: line 1290: 39190 Segmentation fault (core dumped) octopus -R ref.fa -I *.bam -o $out --bamout $evidence --threads=$NIX_BUILD_CORES --fast --max-genotypes 1000 -T chr9 -N $normal --annotations AF
srun: error: milton-med-008: task 0: Exited with exit code 139
salloc: Relinquishing job allocation 1613185
builder for '/nix/store/8kb9sh9j67vbiikdj66pdfzqaw6p1npk-bionix-octopus-callSomatic.drv' failed with exit code 134; last 10 log lines:
[2021-06-15 22:56:05] <INFO> chr12:2170464 1.5% 2m 3s 2h 15m
[2021-06-15 22:56:43] <INFO> chr12:961733 2.0% 2m 41s 2h 11m
[2021-06-15 22:57:27] <INFO> chr12:1144233 2.5% 3m 25s 2h 13m
[2021-06-15 22:58:14] <INFO> chr12:4503738 3.0% 4m 12s 2h 16m
[2021-06-15 22:59:00] <INFO> chr12:5467059 3.5% 4m 58s 2h 16m
[2021-06-15 22:59:49] <INFO> chr12:6163683 4.0% 5m 47s 2h 18m
free(): invalid next size (fast)
/nix/store/13i375llk80lry9g29szivsz9jap7d3p-stdenv-linux/setup: line 1290: 42798 Aborted (core dumped) octopus -R ref.fa -I *.bam -o $out --bamout $evidence --threads=$NIX_BUILD_CORES --fast --max-genotypes 1000 -T chr12 -N $normal --annotations AF
srun: error: milton-med-008: task 0: Exited with exit code 134
salloc: Relinquishing job allocation 1613194
builder for '/nix/store/2b8i8879qbq54s7cp0w1an0f82hf2nby-bionix-octopus-callSomatic.drv' failed with exit code 134; last 10 log lines:
[2021-06-15 23:27:12] <INFO> chr17:79069719 95.1% 42m 46s 2m 12s
[2021-06-15 23:27:53] <INFO> chr17:80051379 96.1% 43m 27s 1m 45s
[2021-06-15 23:28:40] <INFO> chr17:79823782 97.1% 44m 13s 1m 19s
[2021-06-15 23:29:27] <INFO> chr17:80672915 98.1% 45m 1s 52s
[2021-06-15 23:30:17] <INFO> chr17:80905142 99.1% 45m 51s 25s
[2021-06-15 23:33:32] <EROR> Encountered a problem whilst calling chr17:82999972-83163929()
free(): invalid next size (fast)
/nix/store/13i375llk80lry9g29szivsz9jap7d3p-stdenv-linux/setup: line 1290: 13672 Aborted (core dumped) octopus -R ref.fa -I *.bam -o $out --bamout $evidence --threads=$NIX_BUILD_CORES --fast --max-genotypes 1000 -T chr17 -N $normal --annotations AF
srun: error: milton-sml-018: task 0: Exited with exit code 134
salloc: Relinquishing job allocation 1613181
builder for '/nix/store/fkcpbx9wk6hw4izfkszlwv8sp8npggs0-bionix-octopus-callSomatic.drv' failed with exit code 134; last 10 log lines:
[2021-06-16 02:15:39] <INFO> chr6:44223934 26.5% 27m 52s 1h 17m
[2021-06-16 02:16:14] <INFO> chr6:47662002 27.0% 28m 26s 1h 16m
[2021-06-16 02:16:42] <INFO> chr6:46919042 27.5% 28m 55s 1h 16m
[2021-06-16 02:17:04] <INFO> chr6:46970072 28.0% 29m 17s 1h 15m
[2021-06-16 02:17:30] <INFO> chr6:48154109 28.5% 29m 43s 1h 14m
[2021-06-16 02:17:55] <INFO> chr6:48211700 29.0% 30m 7s 1h 13m
free(): invalid next size (fast)
/nix/store/13i375llk80lry9g29szivsz9jap7d3p-stdenv-linux/setup: line 1290: 14614 Aborted (core dumped) octopus -R ref.fa -I *.bam -o $out --bamout $evidence --threads=$NIX_BUILD_CORES --fast --max-genotypes 1000 -T chr6 -N $normal --annotations AF
I'm currently trying to generate a debug log for you that captures one of these, but since it appears a bit random rather than triggered by a specific region it's taking a while. Other than the debug log, would anything else help?
Here's some logs. stderr.log.gz octopus_debug_100000.log.gz
I had to limit the debug log to the last 100k lines as it was too large. If you need the whole log let me know.
This is probably #228.
Describe the bug Cancer model core dumps with std::out_of_range error for _Map_base::at. Unfortunately I can't share this data; any suggestions on capturing debug info that might help you track this down?
Version
Command Command line to install octopus: Using Nix package.
Command line to run octopus:
Additional context Add any other context about the problem here, e.g.