openshift-knative / serverless-operator

Main source repository for Openshift Serverless
Apache License 2.0
59 stars 69 forks source link

Test long running requests in Serving #3038

Closed skonto closed 1 week ago

skonto commented 1 week ago

Fixes JIRA #

Proposed Changes

skonto commented 1 week ago

/test ?

openshift-ci[bot] commented 1 week ago

@skonto: The following commands are available to trigger required jobs:

The following commands are available to trigger optional jobs:

Use /test all to run the following jobs that were automatically triggered:

In response to [this](https://github.com/openshift-knative/serverless-operator/pull/3038#issuecomment-2488180469): >/test ? Instructions for interacting with me using PR comments are available [here](https://git.k8s.io/community/contributors/guide/pull-requests.md). If you have questions or suggestions related to my behavior, please file an issue against the [kubernetes-sigs/prow](https://github.com/kubernetes-sigs/prow/issues/new?title=Prow%20issue:) repository.
skonto commented 1 week ago

/test 417-mesh-e2e-aws-417

skonto commented 1 week ago
11:44:26.122 SUCCESS: 🌟 Tests have passed 🌟
ingresscontroller.operator.openshift.io/default patched
knativeserving.operator.knative.dev/knative-serving patched
Running go test with args: -race -count=1 -tags=e2e -failfast -timeout=60m -parallel=1 ./test/servinge2e/servicemesh/longrunning --kubeconfigs /tmp/kubeconfig-1609998190,/go/src/github.com/openshift-knative/serverless-operator/user1.kubeconfig,/go/src/github.com/openshift-knative/serverless-operator/user2.kubeconfig,/go/src/github.com/openshift-knative/serverless-operator/user3.kubeconfig --imagetemplate {{- with .Name }}
{{- if eq . "httpproxy" }}quay.io/openshift-knative/serving/{{.}}:v1.15
{{- else if eq . "autoscale" }}quay.io/openshift-knative/serving/{{.}}:v1.15
{{- else if eq . "recordevents" }}quay.io/openshift-knative/eventing/{{.}}:v1.15
{{- else if eq . "wathola-forwarder" }}quay.io/openshift-knative/eventing/{{.}}:v1.15
{{- else if eq . "kafka" }}quay.io/strimzi/kafka:latest-kafka-3.4.0
{{- else }}quay.io/openshift-knative/{{.}}:multiarch{{end -}}
{{end -}}
PASS test/servinge2e/servicemesh/longrunning.TestTimeoutForLongRunningRequests (640.28s)
PASS test/servinge2e/servicemesh/longrunning
skonto commented 1 week ago

upstream mesh tests fail:

spoof.go:181: Retrying https://readiness-alternate-port-icngkohe-serving-tests.apps.serverless-ocp-4-17-amd64-aws-us-east-1-5nwb2.serverless.devcluster.openshift.com: retrying for certificate signed by unknown authority: Get "https://readiness-alternate-port-icngkohe-serving-tests.apps.serverless-ocp-4-17-amd64-aws-us-east-1-5nwb2.serverless.devcluster.openshift.com": tls: failed to verify certificate: x509: certificate signed by unknown authority

skonto commented 1 week ago

/retest

skonto commented 1 week ago

/test 417-test-upgrade-aws-417

skonto commented 1 week ago

Mesh tests passed, including the test in this PR. If upgrade tests don't pass I will updated the PR.

skonto commented 1 week ago

Ready to be merged.

skonto commented 1 week ago

@matzew hi, could you stamp this one?

mgencur commented 1 week ago

Hey @skonto , I didn't think this test was supposed to be merged since it takes a long time. Would it be possible to instead decrease all the values? This would have the same effect. IMO:

Aside from that, should we use autoscaling.TargetBurstCapacityKey: "-1", for the service to make sure Activator is on the path?

skonto commented 1 week ago

@mgencur It is only in the mesh tests which we run on demand (put it there intentionally). The idea was to mimic what people asked in the ticket, a timeout beyond the default (600).

Aside from that, should we use autoscaling.TargetBurstCapacityKey: "-1", for the service to make sure Activator is on the path?

With default values in the ksvc (like in this case) and when we scale from zero activator is always on the path. We would only need that if we do load testing.

mgencur commented 1 week ago

We also run this periodically, like here: https://prow.ci.openshift.org/view/gs/test-platform-results/logs/periodic-ci-openshift-knative-serverless-operator-main-413-mesh-e2e-aws-413-c/1859764914193698816 Anyway, it's not a big concern. I suppose it's fine for the lack of time to have the long test now. Maybe in the future shorter values could be used because:

mgencur commented 1 week ago

/lgtm

openshift-ci[bot] commented 1 week ago

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: mgencur, skonto

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files: - ~~[OWNERS](https://github.com/openshift-knative/serverless-operator/blob/main/OWNERS)~~ [mgencur,skonto] Approvers can indicate their approval by writing `/approve` in a comment Approvers can cancel approval by writing `/approve cancel` in a comment
skonto commented 1 week ago

@mgencur ok let's revise this for 1.36 :)