apache / incubator-streampark

Make stream processing easier! Easy-to-use streaming application development framework and operation platform.
https://streampark.apache.org/
Apache License 2.0
3.8k stars 978 forks source link

[Bug] After the task status tracking is lost, remapping remains suspended #2689

Open lukeyan2023 opened 1 year ago

lukeyan2023 commented 1 year ago

Search before asking

Java Version

openjdk version "1.8.0_362" OpenJDK Runtime Environment (Zulu 8.68.0.21-CA-linux64) (build 1.8.0_362-b09) OpenJDK 64-Bit Server VM (Zulu 8.68.0.21-CA-linux64) (build 25.362-b09, mixed mode)

Scala Version

2.12.x

StreamPark Version

2.0.0

Flink Version

1.14.6

Flink 环境为 高可用 StandAlone集群

deploy mode

remote

What happened

After Flink task runs for a period of time, Streamark task running status tracking is lost:

image

The remapping was unsuccessful and remained in the mapping state:

image

Error Exception

After executing remapping, the logs are as follows:

2023-04-23 17:48:15 | INFO  | XNIO-1 task-6 | org.apache.streampark.console.core.task.FlinkRESTAPIWatcher:593] FlinkRESTAPIWatcher add app to tracking,appId:10009
2023-04-23 17:52:05 | INFO  | XNIO-1 task-5 | org.apache.streampark.console.core.task.FlinkRESTAPIWatcher:593] FlinkRESTAPIWatcher add app to tracking,appId:10008

Screenshots

The following is the Streamark task monitoring interface;

image

Below is the running status of FLink cluster tasks:

1682302841713

Are you willing to submit PR?

Code of Conduct

lukeyan2023 commented 10 months ago

Recently tested in Flink 1.17.1 standalone HA cluster environment, there is still a problem that remapping cannot be successful.

As shown below, it is always in mapping

image