Closed bobheadxi closed 3 years ago
@uwedeportivo assigning you for now since you might have some context given it seems you wrote the handbook page on it (https://github.com/sourcegraph/about/pull/534), but let me know if that's not the case!
But we should evaluate a CI solution that doesn't require spinning up a GKE cluster to test. Our CI is a bit brittle on deploy-sourcegraph
right now.
https://github.com/sourcegraph/deploy-sourcegraph/pull/1067 builds (e.g. https://buildkite.com/sourcegraph/deploy-sourcegraph/builds/4557) and other deploy-sourcegraph builds were failing for a variety of reasons:
Then, a few days later:
Since the latter error (which occurs early in the test) replaced the former, I didn't investigate further into the first error, but the second error was traced down to a version change in the test (due to versions not being pinned) to Kubernetes 1.16+, which I was unable to resolve, possibly related to:
A workaround to unblock the 3.21 release was introduced in https://github.com/sourcegraph/deploy-sourcegraph/pull/1068 by simply pinning the kubernetes version to the version used in the last passing build, 1.15.12-gke.20
This is probably not a long-term solution since we will likely need to upgrade eventually, so follow-up items include:
test.sh
files in one place, this handbook page: https://about.sourcegraph.com/handbook/engineering/distribution/k8s_admin_custom_policy - cc @uwedeportivo , seems like you created this in https://github.com/sourcegraph/about/pull/534?Stretch goals might include running the restricted test across multiple versions the same way the "fresh" test does via Pulumi as well