kubewharf / godel-scheduler

a unified scheduler for online and offline tasks
Apache License 2.0
379 stars 58 forks source link

fix event message #7

Closed lzlaa closed 6 months ago

lzlaa commented 6 months ago
  1. 修复 event message 中 pods 计数为负数的异常;并移除 event message unit 失败归因,eg. [UnexpectedError:5]。
# before:
Warning  FailToScheduleUnit  2s    godel-scheduler     Failed to schedule unit. unit message:uint key=PodGroupUnit/default/fifo-podgroup-0, ever scheduled=false, allMember=5, minMember=5; failure message:allPods=0, unHandledPods=-5, successfulPods=0, failedPods=5 [UnexpectedError:5] 

# after: 
Warning  FailToScheduleUnit  1s    godel-scheduler     Failed to schedule unit. unit message:uint key=PodGroupUnit/default/fifo-podgroup-0, ever scheduled=false, allMember=5, minMember=5; failure message:allPods=5, unHandledPods=0, successfulPods=0, failedPods=5
  1. 修复 event message 中 default node group 、node circle key 为空字符串的异常情况。现有的 node group key、node circle key 都被包含在[] 中用于消除 empty key 的影响。
# before  
 Failed to schedule pod in node group: , error: RunPreFilterPlugins faied, error: Failed to run PreFilter plugin "NonNativeTopology" for pod "fifo-deploy-0-56798cd6c6-58mf2": featuregate NonNativeResourceSchedulingSupport is disabled

# after
Failed to schedule pod in node group: [], error: RunPreFilterPlugins faied, error: Failed to run PreFilter plugin "NonNativeTopology" for pod "example-7db669785b-4jvdb": featuregate NonNativeResourceSchedulingSupport is disabled
  1. 修复在没有 failed tasks 时 binder 仍会创建 reject failed tasks 的 event