volcano-sh / volcano

A Cloud Native Batch System (Project under CNCF)
https://volcano.sh
Apache License 2.0
4.15k stars 956 forks source link

Will metrics for pods and jobs be enhanced? #3268

Open bysph opened 10 months ago

bysph commented 10 months ago

What would you like to be added:

Metrics like volcano_pod_info or volcano_job_info which can reflect the association between pod/job and queues.

Why is this needed:

Currently, Volcano lacks these types of metrics, which prevents us from viewing the list of jobs or the number of pods under a certain queue. I believe that supplementing these metrics would be very helpful for production resource monitoring.

bysph commented 10 months ago

@Monokaix This is what I mentioned to you on the phone, we can continue our discussion here : )

Monokaix commented 10 months ago

@Monokaix This is what I mentioned to you on the phone, we can continue our discussion here : )

Thanks for your advice, will check this later.

Monokaix commented 9 months ago

Hi, queue.status has pod number allocated to current queue and we can expose these information by metric.

status:
  allocated:
    cpu: "2"
    memory: "0"
    pods: 2m