volcano-sh / volcano

A Cloud Native Batch System (Project under CNCF)
https://volcano.sh
Apache License 2.0
4.23k stars 969 forks source link

Cannot be rescheduled after PodEvicted #3705

Open jorahbi opened 2 months ago

jorahbi commented 2 months ago

Description

When the controller pod is evicted, the scheduler informant does not receive add or update events, which prevents rescheduling. The job remains pending until the scheduler is restarted

Steps to reproduce the issue

k8s version 1.27

version controller.log vc-controllers scheduler.log vc-scheduler

Describe the results you received and expected

After PodEvicted, rescheduling should receive add and update events from the scheduler

What version of Volcano are you using?

v1.9.0

Any other relevant information

No response

Monokaix commented 2 months ago

Hi can you support more information about why the pod is evcited and also vcjob & pod yaml and status?