Closed cccs-cat001 closed 3 years ago
Looks like the scheduler is getting a sigterm signal. Any hints in the events for the pod (you'll want a recent pod)?
kubectl get event --field-selector involvedObject.name={scheduler_pod_name}
BrokenPipeError happens after sys.exit(0), so you really need to trace down whats sigterming your scheduler constantly.
This issue has been automatically marked as stale because it has been open for 30 days with no response from the author. It will be closed in next 7 days if no further activity occurs from the issue author.
@jedcunningham I'm having the same issue with the scheduler. The scheduler has an event with the following error
`P:>kubectl get event --field-selector involvedObject.name=airflow-ml-dev-scheduler-6699c76bb7-xwrjb -n airflow-ml-dev LAST SEEN TYPE REASON OBJECT MESSAGE 6m44s Warning DNSConfigForming pod/airflow-ml-dev-scheduler-6699c76bb7-xwrjb Search Line limits were exceeded, some search paths have been omitted, the applied search line is: airflow-ml-dev.svc.cluster.local svc.cluster.local cluster.local pldc.kp.org crdc.kp.org ivdc.kp.org 103s Warning Unhealthy pod/airflow-ml-dev-scheduler-6699c76bb7-xwrjb (combined from similar events): Liveness probe failed: Could not find platform independent libraries
Could not find platform dependent libraries Consider setting $PYTHONHOME to [: ] Python path configuration: PYTHONHOME = (not set) PYTHONPATH = (not set) program name = 'python' isolated = 0 environment = 1 user site = 1 import site = 1 sys._base_executable = '/usr/local/bin/python' sys.base_prefix = '/tmp/build/80754af9/python_1599203911753/_h_env_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placeho' sys.base_exec_prefix = '/tmp/build/80754af9/python_1599203911753/_h_env_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placeho' sys.executable = '/usr/local/bin/python' sys.prefix = '/tmp/build/80754af9/python_1599203911753/_h_env_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placeho' sys.exec_prefix = '/tmp/build/80754af9/python_1599203911753/_h_env_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placeho' sys.path = [ '/tmp/build/80754af9/python_1599203911753/_h_env_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placeho/lib/python38.zip', '/tmp/build/80754af9/python_1599203911753/_h_env_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placeho/lib/python3.8', '/tmp/build/80754af9/python_1599203911753/_h_env_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placeho/lib/lib-dynload', ] Fatal Python error: init_fs_encoding: failed to get the Python codec of the filesystem encoding Python runtime state: core initialized ModuleNotFoundError: No module named 'encodings' Current thread 0x00007ff1b4e84740 (most recent call first): `
@saisujithkp the error suggests that your python installation is broken - likely your image has some problems or virtualenvs created by your deployment has been somehow messed up.
I suggest you use the Official helm chart https://airflow.apache.org/docs/helm-chart/stable/index.html and build the image using the official Dockerfile: https://airflow.apache.org/docs/docker-stack/build.html
More info about the "encodings" error is here: https://stackoverflow.com/questions/38132755/importerror-no-module-named-encodings
This issue has been automatically marked as stale because it has been open for 30 days with no response from the author. It will be closed in next 7 days if no further activity occurs from the issue author.
This issue has been closed because it has not received response from the issue author.
I had an identical issue with the scheduler using the Kubernetes Executor for airflow image (2.2.5-python3.8) deployed using Community Airflow Chart.
I solved the issue by turning off the taskCreationCheck
.
Apache Airflow version: 2.1.0
Kubernetes version (if you are using kubernetes) (use
kubectl version
):Environment:
uname -a
):What happened: Since I launched airflow 2.1.0 on our cluster on Friday, the scheduler has failed 716 times stating "BrokenPipeError"
What you expected to happen: For it to not do that.
How to reproduce it: I'm not too sure. Could it be an issue with Airflow 2.1.0 itself, and it can be reproduced just by launching it in a cluster? Using KubernetesExecutor, no celery. Could it be an issue with Azure?
Anything else we need to know: by my very rough calculations it happens every 6 minutes?