flink-extended / flink-remote-shuffle

Remote Shuffle Service for Flink
Apache License 2.0
191 stars 56 forks source link

Add more information about the ShuffleWorker in the ShuffleManagerClient #71

Open Aitozi opened 2 years ago

Aitozi commented 2 years ago

By default, the shuffleManagerClient will print the log as below. But the worker Instance id is not human readable. It can not quick link to the target ShuffleWorker. We need to check the ShuffleManager.log first and then find the target problem worker's ip

2022-05-17 17:10:28,860 INFO  [shuffle-client-JobID{ID=F3918B137043C8D49EBDDBF68CD9D923}] com.alibaba.flink.shuffle.client.ShuffleManagerClientImpl    [] - Got unrelated shuffle worker: InstanceID{ID=31312E3132342E3137372E3136303A33353239392D43314436303544373745364330423843}
2022-05-17 17:10:28,860 INFO  [shuffle-client-JobID{ID=F3918B137043C8D49EBDDBF68CD9D923}] com.alibaba.flink.shuffle.client.ShuffleManagerClientImpl    [] - Got newly related shuffle worker: InstanceID{ID=31312E3132342E3136312E3232393A34353932352D43393734413030463338383239303638}