Open oakrizan opened 2 months ago
Pinging @elastic/security-service-integrations (Team:Security-Service Integrations)
During the migration of beats-ci from Jenkins to Buildkite, a number of tests were failing consistently due to issues unrelated to the migration. Those tests were disabled to stabilize the CI, with the intent to revisit them post-migration. @oakrizan has reviewed them all in her draft PRs linked above in the description, and has opened tickets such as this one to highlight to the product teams the tests that are currently still disabled and could use some attention.
I have potentially fixed problem with CURL by updating the version of observability/stream
from v0.6.1
to v0.7.0
in testing/environments/docker/cometd/Dockerfile
. But now AWS test step fails with AuthorizationError: https://buildkite.com/elastic/beats-xpack-filebeat/builds/5127#01915b0f-c5ba-41ba-9c70-453222f11cf3.
Will look for the solution for that.
In addition removed retry in initCloudEnv.sh since there is no log in case of failure of terraformApply
.
Related PR: https://github.com/elastic/beats/pull/40549
Flaky Test
Test Name: github.com/elastic/beats/v7/x-pack/filebeat/input/cel
Link: Link to file/line number in github.
Branch: main
Artifact Link: If available, attach the generated zip artifact associated with the stack trace for this failure.
Notes: Additional details about the test. e.g. theory as to failure cause In Jenkins AWS tests for
x-pack/filebeat
were disabled on main/8.*:stage
cloud
was disabled within the https://github.com/elastic/beats/pull/23186stage
cloudAWS
(which was introduced as replacement forcloud
stage in https://github.com/elastic/beats/pull/29047) was commented out due to flakiness within the https://github.com/elastic/beats/pull/36558On 7.17 branch AWS tests were enabled for specific changeset and were successful (eg. https://github.com/elastic/beats/pull/35885). After Beats migration from Jenkins to Buildkite, AWS tests were temporarily re-enabled for
x-pack/filebeat
on 8.*/main branches for validation purposes.AWS test failure context: https://github.com/elastic/beats/issues/36425
github.com/elastic/beats/v7/x-pack/filebeat/input/cel
was still occasionally failing with same error. Once retried the test was successful. Though the build failed while was runningcometd
container forlocalstack
integration tests.There was a similar issue opened when running test on Windows: https://github.com/elastic/beats/issues/39657. It was fixed by increasing timeout from 5 to 10 (https://github.com/elastic/beats/pull/39713). Apparently it's sometimes not enough when running tests on AWS.
I have created a https://github.com/elastic/beats/pull/40162, where timeout is 20 and tests seems to be successful when executed on AWS: https://buildkite.com/elastic/beats-xpack-filebeat/builds/4118. Basically this problem can be bypassed either by enabling retry for AWS step, either by increasing timeout again.
beats-xpack-filebeat_build_3351_ubuntu-x-pack-slash-filebeat-aws-tests.log beats-xpack-filebeat_build_3351_ubuntu-x-pack-slash-filebeat-aws-tests-retry.log
7.17 branch Go Integration tests fail for
x-pack/filebeat/input/awss3
which seems to be incorrect/missing bucket policy related issue, since tests fail withAccess Denied status code: 403
. beats-xpack-filebeat_build_3348_ubuntu-x-pack-slash-filebeat-aws-tests.logIssue reproducing on CI
Add
aws
label to beats-related PRCommit changes for x-pack/filebeat & build will be triggered in Buildkite
[ ] Enable AWS tests on main/8.* once fixed
[ ] Enable AWS tests on 7.17 once fixed
Stack Trace