apache / dolphinscheduler

Apache DolphinScheduler is the modern data orchestration platform. Agile to create high performance workflow with low-code
https://dolphinscheduler.apache.org/
Apache License 2.0
12.39k stars 4.49k forks source link

[Bug] [service down] service down and stop when use `bash bin/start-all.sh` a few seconds #16174

Closed ArkURL closed 3 days ago

ArkURL commented 1 week ago

Search before asking

What happened

I use cluster deploy to deploy dolphinscheduler service to 2 server. server and service running env below: os: centos7 db: postgresql-15.7 zookeeper: 3.7.2 python: python3.6 java: java-11

I deploy the service dolphinscheduler to 2 server and make them both be master-server and worker-server. when use bash bin/start-all.sh, every service work well and run perfect, but a few second later, some master-server and worker-server went down, but some workflow still can run.

at the first time every server work well. image

few seconds later, some server down but some workflow can still run. image

how to solve it? here is two server's master-server log file. dolphinscheduler-master.log dolphinscheduler-master_2.log

What you expected to happen

service work well and not down. dolphinscheduler-master_2.log dolphinscheduler-master.log

How to reproduce

use cluster deploy and make them all master-server and worker-server.then start the service.

Anything else

No response

Version

3.2.x

Are you willing to submit PR?

Code of Conduct

ruanwenjun commented 1 week ago

It's better to deprecated below six scripts, these scripts is not settled and I've rarely seen a project that offers a distributed deployment. @SbloodyS

image
SbloodyS commented 1 week ago

It's better to deprecated below six scripts, these scripts is not settled and I've rarely seen a project that offers a distributed deployment. @SbloodyS

image

+1

I'll remove it in a few days.

ruanwenjun commented 1 week ago

It's better to deprecated below six scripts, these scripts is not settled and I've rarely seen a project that offers a distributed deployment. @SbloodyS

image

+1

I'll remove it in a few days.

Thanks.

ruanwenjun commented 3 days ago

@ArkURL Thanks for your report this problem, we plan to remove these kind of script, this means you should start the server in remote server after ssh login.