numaproj / numaflow

Kubernetes-native platform to run massively parallel data/streaming jobs
https://numaflow.numaproj.io/
Apache License 2.0
1.09k stars 111 forks source link

Pipeline never paused #2057

Open juliev0 opened 2 weeks ago

juliev0 commented 2 weeks ago

Describe the bug The Numaplane e2e test passes most of the time but I want it to pass 100% of the time. This Issue describes a case.

This is the sequence of steps it had executed up until when the Pipeline never got paused:

Pipeline spec:

pipelineSpec        = numaflowv1.PipelineSpec{
        InterStepBufferServiceName: isbServiceRolloutName,
        Vertices: []numaflowv1.AbstractVertex{
            {
                Name: "in",
                Source: &numaflowv1.Source{
                    Generator: &numaflowv1.GeneratorSource{
                        RPU:      &pipelineSpecSourceRPU,
                        Duration: &pipelineSpecSourceDuration,
                    },
                },
                Scale: numaflowv1.Scale{Min: &numVertices, Max: &numVertices, ZeroReplicaSleepSeconds: &zeroReplicaSleepSec},
            },
            {
                Name: "out",
                Sink: &numaflowv1.Sink{
                    AbstractSink: numaflowv1.AbstractSink{
                        Log: &numaflowv1.Log{},
                    },
                },
                Scale: numaflowv1.Scale{Min: &numVertices, Max: &numVertices, ZeroReplicaSleepSeconds: &zeroReplicaSleepSec},
            },
        },
        Edges: []numaflowv1.Edge{
            {
                From: "in",
                To:   "out",
            },
        },
    }

This zip file contains the following:

This is the full numaplane log.


Message from the maintainers:

Impacted by this bug? Give it a 👍. We often sort issues this way to know what to prioritize.

For quick help and support, join our slack channel.

juliev0 commented 1 week ago

looks like github closed this prematurely when you'd only written "partially fixes" in the PR, so reopening