Closed meua closed 1 month ago
Can I ask why are you killing the daemon?
We do not support auto-restarting daemon at the moment.
Can I ask why are you killing the daemon?
We do not support auto-restarting daemon at the moment.
Because, there are some reasons due to custom nodes and operators, which will cause dora-daemon to hang innocently. I kill the dora-daemon process to simulate this situation.
Do you have any ideas or context you can share about why dora-daemon to hang innocently?
Do you have any ideas or context you can share about why dora-daemon to hang innocently?
I am not running in source debug mode,after dora up
, run RUST_LOG=true dora start graphs/tutorials/webcam.yaml --attach --hot-reload --name webcam
, dataflow cannot be stopped
(dora3.7) jarvis@jia:~/coding/pyhome/github.com/dora-rs/dora-drives$ dora list
Running dataflows:
- [webcam] 2eeba0b6-4cfa-438a-bc7f-0747664e06f3
(dora3.7) jarvis@jia:~/coding/pyhome/github.com/dora-rs/dora-drives$ dora stop
> Choose dataflow to stop: [webcam] 2eeba0b6-4cfa-438a-bc7f-0747664e06f3
(dora3.7) jarvis@jia:~/coding/pyhome/github.com/dora-rs/dora-drives$ dora list
Running dataflows:
- [webcam] 2eeba0b6-4cfa-438a-bc7f-0747664e06f3
(dora3.7) jarvis@jia:~/coding/pyhome/github.com/dora-rs/dora-drives$ dora -V
dora-cli 0.2.3
(dora3.7) jarvis@jia:~/coding/pyhome/github.com/dora-rs/dora-drives$ dora logs 2eeba0b6-4cfa-438a-bc7f-0747664e06f3 webcam
> │ Logs from webcam.
─────┼─────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────
1 │ could not get webcam.
2 │ could not get webcam.
3 │ could not get webcam.
4 │ could not get webcam.
5 │ could not get webcam.
6 │ could not get webcam.
7 │ could not get webcam.
8 │ could not get webcam.
9 │ could not get webcam.
10 │ could not get webcam.
11 │ could not get webcam.
12 │ could not get webcam.
13 │ could not get webcam.
14 │ could not get webcam.
This should have been fixed with grace duration
Describe the bug dora-daemon hangs up due to heartbeat timeout, but dora-coodinator is running normally, then I restart dora-daemon, when the dataflow is closed by dora stop uuid, it cannot be closed.
open new terminal and kill dora-daemon, simulate the daemon process to hang up abnormally
To Reproduce Steps to reproduce the behavior:
dora-cli up
dora-cli start examples/python-operator-dataflow/dataflow.yaml --attach --hot-reload
kill -15 pid_dora_daemon
dora-cli up
dora-cli stop uuid_your_dataflow
Expected behavior I expect dora-coodinator and dora-daemon to live and die together, and they can automatically restart when the heartbeat times out, Or dora-daemon hangs up, and dataflow is also destroyed.
Environments (please complete the following information):