x/vuln: false positive call stack for GO-2023-2402 in Kubernetes

aojea commented 7 months ago

$ govulncheck ./...
Scanning your code and 2016 packages across 204 dependent modules for known vulnerabilities...

Vulnerability #1: GO-2023-2402
    Man-in-the-middle attacker can compromise integrity of secure channel in
    golang.org/x/crypto
  More info: https://pkg.go.dev/vuln/GO-2023-2402
  Module: golang.org/x/crypto
    Found in: golang.org/x/crypto@v0.14.0
    Fixed in: golang.org/x/crypto@v0.17.0
    Example traces found:
      #1: test/e2e/framework/ssh/ssh.go:316:33: ssh.runSSHCommandViaBastion calls ssh.Client.Dial
      #2: test/e2e/framework/ssh/ssh.go:329:35: ssh.runSSHCommandViaBastion calls ssh.Client.NewSession
      #3: test/e2e/framework/ssh/ssh.go:301:32: ssh.runSSHCommandViaBastion calls ssh.Dial
      #4: test/e2e/framework/ssh/ssh.go:326:25: ssh.runSSHCommandViaBastion calls ssh.NewClient
      #5: test/e2e/framework/ssh/ssh.go:322:44: ssh.runSSHCommandViaBastion calls ssh.NewClientConn
      #6: test/e2e/framework/ssh/ssh.go:333:2: ssh.runSSHCommandViaBastion calls ssh.Session.Close
      #7: test/e2e/framework/ssh/ssh.go:339:22: ssh.runSSHCommandViaBastion calls ssh.Session.Run
      #8: test/e2e/framework/ssh/ssh.go:320:2: ssh.runSSHCommandViaBastion calls ssh.channel.Close
      #9: test/utils/harness/harness.go:54:14: harness.Harness.Close calls ssh.stdin, which calls ssh.channel.CloseWrite
      #10: cmd/kubeadm/app/preflight/checks.go:339:18: preflight.FileContentCheck.Check calls io.Copy, which eventually calls ssh.channel.Read
      #11: cmd/kubeadm/app/preflight/checks.go:544:13: preflight.SystemVerificationCheck.Check calls bufio.Writer.Flush, which calls ssh.channel.Write
      #12: cmd/kubeadm/app/preflight/checks.go:339:18: preflight.FileContentCheck.Check calls io.Copy, which eventually calls ssh.extChannel.Read

Your code is affected by 1 vulnerability from 1 module.

Share feedback at https://go.dev/s/govulncheck-feedback.

There are doubts about the next three traces found

calls io.Copy, which eventually calls ssh.channel.Read calls bufio.Writer.Flush, which calls ssh.channel.Write calls io.Copy, which eventually calls ssh.extChannel.Read

@neolit123 @dims

xref: https://github.com/kubernetes/kubernetes/issues/122424

thanm commented 7 months ago

I will let the vulncheck experts weigh in here, but from the peanut gallery this seems like a natural consequence of the way Go interfaces work.

This is a static analysis tool; if io.Copy invokes a method on an io.Reader interface object, the checker in most cases won't know the concrete type of the object underlying the interface. This means it will have to conservatively assume that the underlying object could be any type that satisfies the io.Reader interface. The type ssh.channel.Read is live (not deadcoded) in your binary, so the assumption has to be that it might be invoked here.

thanm commented 7 months ago

@golang/vulndb per owners

dims commented 7 months ago

@thanm makes life harder for maintainers :(

rittneje commented 7 months ago

@thanm In this particular case, I'd really expect the static analyzer to work out the concrete types, since it is already looking at particular traces. Namely, it ought to determine the io.Reader is an io.LimitedReader wrapping an os.File, since they are all local variables declared right beforehand. I'm sure that in general there are cases where determining the concrete type is prohibitively difficult, but vulncheck needs to minimize false positives in order to be useful.

In addition, if vulncheck is unable to determine the concrete type, it should emit something clearer in the log message. For example: "calls io.Copy, which eventually calls io.Reader.Read, which might resolve to ssh.channel.Read"

neolit123 commented 7 months ago

In addition, if vulncheck is unable to determine the concrete type, it should emit something clearer in the log message. For example: "calls io.Copy, which eventually calls io.Reader.Read, which might resolve to ssh.channel.Read"

for common stdlib interfaces like io.Reader that would still be an incredibly wild guess by the tool.

zpavlinovic commented 7 months ago

This is unfortunately a current limitation in the tool. Knowing the exact concrete types for an interface in general is an undecidable problem. Hence govulncheck has to approximate and it chooses to over-approximate.

It computes an over-approximate call graph and then it finds a call stack that leads to a vulnerability. There are typically many such call stacks and some of them might not exist in practice due to the over-approximate nature of the call graph. It is impossible to validate each call stack so we employ some heuristics with the goal of picking one that is least likely to be a false positive. We could probably do a better job: ssh.channel is passed to io.Copy somewhere else, but I don't think it is even mentioned in the call stack here. I will look into this.

thediveo commented 7 months ago

how to deal with such false positives? will there be a block list, preferably based on the call path and not just CVE?

Jorropo commented 7 months ago

@thediveo go get golang.org/x/crypto@v0.17.0 or @latest :sparkles:

You could have a situation where the only security fixes happen in versions where there are incompatible breaking changes. But in my experience this is exceedingly rare in the Go community. Except in cases where the original design was wrong like in GO-2022-0646.

Jorropo commented 7 months ago

If we really want a block list, I think it should be explicit like govulncheck --ignore=path/to/block/list.txt.

timothy-king commented 7 months ago

It is fairly clear that the reported path are not real. That should definitely be improved, but it does not appear to be the root cause.

From what I can tell, it appears that the core of the issue is that the analysis cannot conclude that io.Copy is not called with a Reader and a Writer from ssh like golang.org/x/crypto/ssh.channel. For example, ssh.Session.stdout() calls io.Copy(s.Stdout, s.ch). So to conclude the program is not vulnerable requires deducing that ssh.Session.stdout() (etc.) is unreachable within the program. These are themselves reachable from functions in the package k8s.io/kubernetes/test/e2e/framework/ssh. So deducing that the vulnerability is unreachable also requires finding functions in this package are unreachable from any main package.

Using go list -f '{{.Name}}: {{ .ImportPath }}: {{ .Deps }}' ./... we can conclude both the main packages k8s.io/kubernetes/test/e2e/network/scale/localrun and k8s.io/kubernetes/test/soak/serve_hostnames depend on the package k8s.io/kubernetes/test/e2e/framework/ssh. This package is in turn used in quite a few places (grep -R 'k8s.io/kubernetes/test/e2e/framework/ssh') . I have not yet found an obvious smoking gun for either of these. So my hunch is that functions like k8s.io/kubernetes/test/e2e/framework/ssh.SSH are more likely to be unreachable. Getting to the bottom of why govulncheck cannot conclude they are unreachable in these two packages will require a deep dive into its internals though.

As a practical point, partitioning large modules can a good way to improve precision. This can remove false positives due to aliasing. (If golang.org/x/crypto/ssh is not depended on, ssh.Session.stdout() will not be imported, will not create the alias, and will not that bogus trace.) So +1 to partitioning like https://github.com/kubernetes/kubernetes/issues/122424#issuecomment-1866537540 .

how to deal with such false positives?

At the moment this is up to each maintainer to triage and appropriately decide to update, ignore, or build a custom suppression mechanism.

will there be a block list, preferably based on the call path and not just CVE?

Keep in mind that the number of paths of this form are potentially exponential in the number of functions in the program. Even if govulncheck supported this, it should probably quickly give up trying to find a path and would report instead that 'Vuln #XYZ may be reachable, but we could not find a feasible path.' It is not clear this would put us in an appreciably different situation.

Edges are only quadratic in the # of functions in the program. For example, asserting that the Read in io.copyBuffer does not call golang.org/x/crypto/ssh.extChannel.Read. We could also have assertion that some functions are unreachable. (Pro: linear. Con: Requires the user to think about aliasing.) Also user assertions that are correct today can become wrong over time (and a file encourages them to be checked in) so if supported this should be done with care. There is no plan implement this at the moment, but I would encourage you to file a new issue with a feature request that includes what types of assertions you are hoping for if you are interested.

I'd really expect the static analyzer to work out the concrete types, since it is already looking at particular traces. Namely, it ought to determine the io.Reader is an io.LimitedReader wrapping an os.File, since they are all local variables declared right beforehand.

If you are familiar with trace based static analysis (like symbolic execution), that may mislead you to how govulncheck works under the hood. It is doing callgraph based reachability at the moment. However, the intuition that it should be able to statically determine from the callsite that certain variables within the callee have a fixed type is a good one and leads to what I suspect will be the long term fix for this case. In essence, if one was to inline both io.Copy and io.copyBuffer that is enough precision to devirtualize these calls, which would remove these spurious traces. Unfortunately doing depth 2 calls like this naively is like making the program O(n^3) in size in order to analyze it more precisely. This kind of 'inlining' is doable IMO, but will require tuning some heuristics in order to not blow up. This will take some investigation (and probably a Go proposal).

golang / go

x/vuln: false positive call stack for GO-2023-2402 in Kubernetes #64834