Closed nick-youngblut closed 6 years ago
Can you add the log file for
Q(fx, x=1:3, n_jobs=1, log_worker=TRUE)
?
In general, I don't see how clustermq
can cause segfaults, because it does not contain compiled code. Are you sure rzmq
was installed correctly and their example on their README works? This is likely an issue between ZeroMQ
/rzmq
and conda
.
Sorry for not being clear in my original post. The log file is:
/var/spool/gridengine/execd/node514/job_scripts/1191974: line 8: ulimit: virtual memory: cannot modify limit: Operation not permitted
WARNING: ignoring environment value of R_HOME
/var/spool/gridengine/execd/node514/job_scripts/1191974: line 9: 63852 Segmentation fault R --no-save --no-restore -e 'clustermq:::worker("tcp://rick.eb.local:7775")'
I haven't tried the rmzq
example, but that could definitely be the problem.
I meant the clustermq
worker log file, not the job log file
@nick-youngblut Any news on the log file?
I'd expect to see at least the R startup messages, otherwise your R itself may be borked.
When I run this, I see in cmq6557.log
:
R version 3.4.4 (2018-03-15) -- "Someone to Lean On"
Copyright (C) 2018 The R Foundation for Statistical Computing
[...]
> clustermq:::worker("tcp://pg-node062:6557")
Master: tcp://pg-node062:6557
WORKER_UP to: tcp://pg-node062:6557
> DO_SETUP (0.000s wait)
[...]
Sorry about the slow reply. I've been working on getting batchtools
to work and haven't had time to look further into this yet. I'm guessing that the problem is rzmq
, but I need to look more into the problem.
Closing this due to inactivity. Please reopen if the log shows this is a clustermq
problem.
The
Q
function is stalling during the simple demo:The progress bar stays a 0%. The qsub job starts and errors out. The error is the following:
My job template file:
SessionInfo:
I'm using
R
installed via conda. My conda env info is:I had to install
clustermq
in R withinstall.packages()
, because there's no conda package forclustermq
.