Open moro-drake opened 4 days ago
This is an interesting issue as we appear to make use of Exec
everywhere where we call git
, which in turn uses cmd.CombinedOutput()
that makes sure to .Wait()
(which is normally a reason for zombie processes).
Needs a more thorough investigation.
Checklist
kargo version
.Description
We use kargo in openshift cluster. Openshift runs it with the following user:
runAsUser: 1001910000
(ps output for this user attached as screenshots). Since 1.0.3 update we have noticed it creates zombie processes [git]. Those process slowly bulk up and make controller unusable (unix fork can't create more processes).Warehouse
set to discover new tags usingNewestTag
strategy speeds up this process (you can see a 'zombie' spawn on each refresh of WH in the UI). With 1m interval and 20 discovery limit kargo-controller was dead in half a day (we had about 44 active WHs with 'NewestTag' subscription)Screenshots
Steps to Reproduce
ps auxf | grep 'defunct' | wc -l
NewestTag
in the UIps auxf | grep 'defunct' | wc -l
- count will increment as new 'zombie' has spawned.Version
Kargo v1.0.3
Logs