Open jdonnelly-apixio opened 1 year ago
@jdonnelly-apixio hey! We did not yet experience the issue, but this could lead to serious issues. At the moment "only" 4.5k configmaps on our end. Have you found how to clean this by the end of the job run?
@jdonnelly-apixio Do you have solved this problem? I think we have meet the same problem, our cluster has ~120k+ configmaps
I had the following error in the driver pod:
ERROR Utils: Uncaught exception in thread Thread-19 Message: Forbidden!Configured service account doesn't have access. Service account may have been revoked. configmaps is forbidden: User "system:serviceaccount:spark-ns:spark-operator" cannot list resource "configmaps" in API group "" in the namespace "spark-ns".
I have created a clusterrolebinding to give the edit role to the service operator (called spark-operator in my case).
kubectl create clusterrolebinding spark-role --clusterrole=edit --serviceaccount=spark-ns:spark-operator --namespace=spark-ns
I am facing a similar error just that instead of configmaps its not able create pods. Can this be a issue with permissions given to the spark role ?
+1 for this issue.
+1 here also
+1 here
any updates on how to resolve this issue?
ConfigMaps don't get cleaned up when sparkapplications are deleted. I think it might be good to include owner references for the configmaps that are created so cascading deletes can happen. I had ~160k in my cluster and it was causing timeouts listing configmaps near the end of spark jobs.