Open tgebo opened 2 years ago
@yashpatel6 we can add some comments about the fix for the record.
I think the root cause may be related to max user processes
, which is 4096 as default. I had a similar issue before with hatchet. (OpenBLAS and I had to add some extra env variables to adjust # threads) If we see this issue with different tools and want to increate the ulimit, we'll have to ask OHIA or we may need to adjust the number of intervals/jobs running at the same time.
See max user processes
below.
(base) [tyamaguchi@ip-0A12521D CN_20]$ ulimit -a
core file size (blocks, -c) 0
data seg size (kbytes, -d) unlimited
scheduling priority (-e) 0
file size (blocks, -f) unlimited
pending signals (-i) 15068
max locked memory (kbytes, -l) unlimited
max memory size (kbytes, -m) 3145728
open files (-n) 131072
pipe size (512 bytes, -p) 8
POSIX message queues (bytes, -q) 819200
real-time priority (-r) 0
stack size (kbytes, -s) unlimited
cpu time (seconds, -t) unlimited
max user processes (-u) 4096
virtual memory (kbytes, -v) unlimited
file locks (-x) unlimited
@yashpatel6 we can add some comments about the fix for the record.
I think the root cause may be related to
max user processes
, which is 4096 as default. I had a similar issue before with hatchet. (OpenBLAS and I had to add some extra env variables to adjust # threads) If we see this issue with different tools and want to increate the ulimit, we'll have to ask OHIA or we may need to adjust the number of intervals/jobs running at the same time.See
max user processes
below.(base) [tyamaguchi@ip-0A12521D CN_20]$ ulimit -a core file size (blocks, -c) 0 data seg size (kbytes, -d) unlimited scheduling priority (-e) 0 file size (blocks, -f) unlimited pending signals (-i) 15068 max locked memory (kbytes, -l) unlimited max memory size (kbytes, -m) 3145728 open files (-n) 131072 pipe size (512 bytes, -p) 8 POSIX message queues (bytes, -q) 819200 real-time priority (-r) 0 stack size (kbytes, -s) unlimited cpu time (seconds, -t) unlimited max user processes (-u) 4096 virtual memory (kbytes, -v) unlimited file locks (-x) unlimited
Got it, I think part of the reason is also the scratch space running out due to ApplyBQSR
being parallelized and the pipelines having to wait for both Indel Realignment and BQSR to complete before deleting files. I've tried lowering the number of split intervals but the disk space issue causes the pipeline to fail so once I add the fix for processing the normal and tumour BQSR together, I'll test it again and see if the same issue pops up again.
v7.2.0
Slurm-Dev
F72
python
.py script
/hot/users/tgebo/WCDT/scripts/call-gSNP/DTB-005.config
/hot/users/tgebo/pipelines/pipeline-call-gSNP
/hot/users/tgebo/pipelines/pipeline-call-gSNP/DTB-005.log
* Changed parameter from previous run in #49 back to default value: scatter_count = 50**