Closed TobiasGoerke closed 5 months ago
Assign @Linchin to fix this issue by adding RBAC permission to aggregate-to-kubeflow-pipelines-edit.
cc @gkcalat
I did face the same problem in Kubeflow 1.7.0. Thank you @zijianjoy for looking into it.
Hi @TobiasGoerke, thank you so much for reporting the issue! I found the place to add the permission, which seems to be associated with permissions given to service account in a namespace. Could you please help me and share a minimum pipeline that has the original issue, i.e., "pipeline steps being stuck / not being displayed as succeeded in the UI"? Thank you!
Hi @TobiasGoerke, thank you so much for reporting the issue! I found the place to add the permission, which seems to be associated with permissions given to service account in a namespace. Could you please help me and share a minimum pipeline that has the original issue, i.e., "pipeline steps being stuck / not being displayed as succeeded in the UI"? Thank you!
Glad to hear so!
Unfortunately, I'm not able to reproduce the stuck pipeline issue anymore. I recall the output file's content being stored to the pod annotation workflows.argoproj.io/outputs
, though..
The pipeline must have looked similar to this
# Attention: this example doesn't reproduce the issue
import kfp.dsl as dsl
from kfp.components import OutputPath, create_component_from_func
def example_step(test_path: OutputPath()):
import pickle
import numpy as np
arr = [np.array([1, 2, 3]), np.array([1, 2, 3]), np.array([1, 2, 3])]
with open(test_path, "wb") as f:
pickle.dump(arr, f)
print("Finished")
example_op = create_component_from_func(
example_step, base_image=BASE_IMAGE, packages_to_install=["numpy"]
)
@dsl.pipeline(name="Test Pipeline")
def pipeline():
example_op()
plain_pipeline_result = client.create_run_from_pipeline_func(
pipeline, arguments={}, experiment_name=EXPERIMENT_NAME, namespace=NAMESPACE
)
This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.
Hi @TobiasGoerke, thank you so much for reporting the issue! I found the place to add the permission, which seems to be associated with permissions given to service account in a namespace. Could you please help me and share a minimum pipeline that has the original issue, i.e., "pipeline steps being stuck / not being displayed as succeeded in the UI"? Thank you!
I met the same question, and how to deal with it?
Hi @TobiasGoerke, thank you so much for reporting the issue! I found the place to add the permission, which seems to be associated with permissions given to service account in a namespace. Could you please help me and share a minimum pipeline that has the original issue, i.e., "pipeline steps being stuck / not being displayed as succeeded in the UI"? Thank you!
I met the same question, and how to deal with it?
Our users redesigned their pipelines and I lost track of the issue as stuck pipelines haven't resurfaced for us. If you're currently having this problem, I'm sure providing a reproducible example would lead to this issue getting fixed. Cheers!
This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.
This issue has been automatically closed because it has not had recent activity. Please comment "/reopen" to reopen it.
When executing any pipeline, the following lines are logged in the
wait
container:In addition to Argo considering this method to be legacy / insecure, we've had issue with pipeline steps being stuck / not being displayed as succeeded in the UI, depending on the files that were written to the OutputPath, as the resulting pod's annotation was malformed.
Fixing this RBAC issue is easily done by adding the resource
workflowtaskresults
to the aggregate-to-kubeflow-pipelines-edit ClusterRole.However, this will cause Argo Workflow to no longer write the annotation
workflows.argoproj.io/outputs
to the pipeline's pods, which several Kubeflow components rely on, e.g.: