Closed oindrillac closed 2 years ago
resource cap for this namespace can be updated via: https://github.com/operate-first/apps/blob/master/cluster-scope/base/core/namespaces/ds-ml-workflows-ws/resourcequota.yaml
the fix was to allocate more resources to the pod by increasing resources in the seldon deployment config
Describe the Problem
Our seldon deployment is failing without any descriptive log at the model de-serialization step.
Tested this locally and on jupyterhub, this works.
On the ds-ml-worklows-ws namespace, the pods error out and get OOMkilled after downloading the model.
Wonder if this happening because of the memory limit on the namespace? Can we increase the limit?
cc: @chauhankaranraj @suppathak
Steps to Reproduce
Expected behaviour
Pod should have spun up successfully. Model deployment should return predictions as expected.
Screenshots
Additional context
related: https://github.com/open-services-group/community/issues/174