Closed AlexLov closed 1 month ago
Hi @AlexLov, Do the instructions here work for you? https://docs.robusta.dev/master/playbook-reference/actions/scans.html#taints-tolerations-and-nodeselectors
Oh, I somehow overlooked this page :( Sure it should do the trick for me. Sorry for inconvenience.
All good! Any idea where you looked in the docs/github? I'll make sure we add a link so it is more discoverable.
I looked first into values.yaml
of the chart and then in code directly. I checked the docs awhile ago and haven't seen this page (or just didn't go that deep then).
Maybe placing the page above * Troubleshooting
pages in the list would help it to be more visible. For me these troubleshooting pages and anything beyond kinda advanced stuff that needed only occasionally so no need to dig deep until really needed.
Is your feature request related to a problem? I have KRR pods often killed by OOM in some big clusters (like 3000+ pods) while I can adjust memory request/limit of that pod it also starts on quite packed nodes dedicated for main workload and this adjustments to memory might interfere with it. For some side workloads like monitoring and related staff (like robusta) I have dedicated nodes with enough resources so they won't interfere with main workload even if they consume all the node's resources. I use nodeSelectors and tolerations to run all my services on these dedicated nodes and prevent main cluster's workload to be scheduled there.
Describe the solution you'd like Please add options to configure nodeSelector and tolerations for KRR job or at least let them to be taken from robusta-runner pod itself.
Describe alternatives you've considered There are none. I didn't find how to disable KRR pod to be run at all either.