-
I am having an issue when dmtcp tries to restart it fails with the following error
dmtcp_coordinator starting...
Host: hero36 (10.5.40.36)
Port: 46201
Checkpoint Interval: 1800
Exit on…
-
Hello,
I run dmtcp from master branch (98756ac092aa600) in interactive SLURM allocation. In one console I have dmtcp coordinator, in another console I execute dmtcp_launch as follows:
dmtcp_laun…
-
I got an error when I am trying to create checkpoint from my application.
I'm using CUDA 11.4 and tensorRT 8.4 in my application.
My plaform is Nvidia jetson Xavier NX.
ARM®v8.2 64
Ubuntu 20.04.…
-
./test in this case is just a binary with "while(1);" sitting in a loop, just to see what falls out without any complex interactions
gourry@RagudoMezegis:~/git/dmtcp/bin$ valgrind --leak-check=full…
-
Currently, the following works in DMTCP:
```
LD_LIBRARY_PATH=$DMTCP_ROOT/lib/dmtcp dmtcp_launch --with-plugin libmodify-env.so a.out
```
Snce DMTCP already knows about the directory `$DMTCP_ROOT/lib…
-
My job kept resubmitting itself, so I checked the logs, here's a snippet from `$VSC_SCRATCH/chkpt//csub_test.sh.20180911_150818.eg.base.err.all`:
```
real 0m0.536s
user 0m0.020s
sys 0m0…
-
The dmtcp_launch(1) man page DESCRIPTION section is currently:
```
DESCRIPTION
dmtcp_launch launches a process under DMTCP control.
A typical usage is:
rm ckpt_a.out_*.dm…
-
I am very new to DMTCP (version--2.4.4). May be I am doing some very silly mistakes in the following script which is just to test DMTCP and MPI for checkpointing and restart. The checkpoint images are…
-
Is there a way to have the data persist after container updates / restarts?
-
http://dmtcp.sourceforge.net/dmtcp-mug-17.pdf and also http://dmtcp.sourceforge.net/ speak of the OpenGL DMTCP plugin. The entire publication can be found here https://arxiv.org/abs/1312.6650 and it s…