apache / incubator-streampark

Make stream processing easier! Easy-to-use streaming application development framework and operation platform.
https://streampark.apache.org/
Apache License 2.0
3.91k stars 1.01k forks source link

[Bug] When Yarn is low on resources, Yarn accepts the app submitted by streampark and is in an accepted state. After a period of time, streampark prompts for timeout and the app on Yarn has not been killed, resulting in the app running after Yarn has resources. The corresponding job status on streampark is still failed. #2574

Closed FrankSheng closed 1 year ago

FrankSheng commented 1 year ago

Search before asking

Java Version

1.8

Scala Version

2.12.x

StreamPark Version

2.0.0

Flink Version

1.14

deploy mode

yarn-application

What happened

When Yarn is low on resources, Yarn accepts the app submitted by streampark and is in an accepted state. After a period of time, streampark prompts for timeout and the app on Yarn has not been killed, resulting in the app running after Yarn has resources. The corresponding job status on streampark is still failed.

Error Exception

java.util.concurrent.CompletionException: java.util.concurrent.TimeoutException
    at java.util.concurrent.CompletableFuture.encodeThrowable(CompletableFuture.java:292)
    at java.util.concurrent.CompletableFuture.completeThrowable(CompletableFuture.java:308)
    at java.util.concurrent.CompletableFuture.orApply(CompletableFuture.java:1371)
    at java.util.concurrent.CompletableFuture$OrApply.tryFire(CompletableFuture.java:1350)
    at java.util.concurrent.CompletableFuture$CoCompletion.tryFire(CompletableFuture.java:1020)
    at java.util.concurrent.CompletableFuture.postComplete(CompletableFuture.java:474)
    at java.util.concurrent.CompletableFuture.completeExceptionally(CompletableFuture.java:1977)
    at org.apache.streampark.common.util.CompletableFutureUtils$$anon$1.call(CompletableFutureUtils.scala:46)
    at org.apache.streampark.common.util.CompletableFutureUtils$$anon$1.call(CompletableFutureUtils.scala:45)
    at java.util.concurrent.FutureTask.run(FutureTask.java:266)
    at java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.access$201(ScheduledThreadPoolExecutor.java:180)
    at java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.run(ScheduledThreadPoolExecutor.java:293)
    at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142)
    at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617)
    at java.lang.Thread.run(Thread.java:748)
Caused by: java.util.concurrent.TimeoutException
    ... 8 more

Screenshots

No response

Are you willing to submit PR?

Code of Conduct

wolfboys commented 1 year ago

Can you rewrite the issue title?