StackStorm / st2

StackStorm (aka "IFTTT for Ops") is event-driven automation for auto-remediation, incident responses, troubleshooting, deployments, and more for DevOps and SREs. Includes rules engine, workflow, 160 integration packs with 6000+ actions (see https://exchange.stackstorm.org) and ChatOps. Installer at https://docs.stackstorm.com/install/index.html
https://stackstorm.com/
Apache License 2.0
6.07k stars 749 forks source link

Stackstorm action runner does not return / complete - hangs and process is defunct #5300

Open camb1997 opened 3 years ago

camb1997 commented 3 years ago

When running an action that makes an internal HTTP call (from inside python with requests) that returns a lot of data. The called service is completing and returning HTTP payload over the network (we wiresharked). The action process (we think ?) (after about 15 seconds after the HTTP server returns its data) goes to "defunct" state. The parent process of the action process is the main action runner process. The core.http action works for same call ; we cannot use it because it will publish the output to the workflow and that is too much data to be in the workflow context due to Mongo db driver/performance issues.

st2 3.4.1, on Python 3.6.8 Running in Docker & Kubernetes OS is (cat/etc/os-release): NAME="CentOS Linux" VERSION="8" ID="centos" ID_LIKE="rhel fedora" VERSION_ID="8" PLATFORM_ID="platform:el8" PRETTY_NAME="CentOS Linux 8"

techdiverdown commented 3 years ago

We are having this same issue.

MichaelMcClure commented 3 years ago

Related issue fixed in ST2 3.5? https://github.com/StackStorm/st2/pull/5220

stale[bot] commented 2 years ago

Thanks for contributing to this issue. As it has been 90 days since the last activity, we are automatically marking is as stale. If this issue is not relevant or applicable anymore (problem has been fixed in a new version or similar), please close the issue or let us know so we can close it. On the contrary, if the issue is still relevant, there is nothing you need to do, but if you have any additional details or context which would help us when working on this issue, please include it as a comment to this issue.