rficcaglia commented 5 years ago

I recommend deprecating this issue and referring interested readers to a discussion on the Policy WG call documented here:

https://docs.google.com/document/d/1ihFfEfgViKlUMbY2NKxaJzBkgHh-Phk5hqKTzK-NEEs/edit#

DEPRECATED:

During the K8S Policy WG session today (6/5/2019) we discussed how/if policies might be formally verified. We talked about using Datalog (pros and lots of cons) in formal verification, and insofar as OPA's Rego is similar to Datalog (though not strictly Datalog), it maybe might be possible. Maybe. In any case, I volunteered to write up a strawman outline of what this might look like in very hand wavy terms to get the discussion started. @hannibalhuang asked me to put it here in sig-security. @patrick-east @ericavonb were also interested in reviewing I believe. Enjoy...

Formal Verification of Policy In Kubernetes

Human writes a policy
- it might be a policy to grant or deny user access to a resource in a multi-tenant cluster,
- or it might be a policy that requires certain syscall activity to be monitored on some pods with certain labels,
- or it might be a policy that says network traffic that is regulated by PCI or HIPAA should be read-only to some microservices but writeable by others,
- or it might be a policy that specifies some alert action to be triggered when a given audit event occurs;
The policy is essentially a specification of what the expected behavior should be for the system, i.e. System + Policy => Safety Properties (nothing bad happens)
A tool checks "validity" of the policy and that the system execution matches the specification.
- produces verification "proof" (model) if the policy is correct – or generates a counterexample if the policy is not correct
Verification is completely automatic
- Human can say with confidence, "this policy correctly implements the behavior I intended"
- Software can use the proof/model and reason with it.

How is Verification Done?

Conditions in Logic (e.g. Rego rules)
Given a set of logic rules, P, check whether there exists a proof/model of P
Given a model m (verification conditions) now use a solver to try to solve them

justincormack commented 5 years ago

Sorry, this is somewhat unclear. If I write a policy, who is writing the specification it is checked against? I think the summary needs spelling out in more detail.

There are actually interesting things you can check on a single policy, but they are not correctness. eg AWS has done some work on finding parts of a policy that are "dead code" as they probably indicate mistakes, and finding policies that map to "probably unintended things" eg world writeable S3 buckets (potentially a safety property).

rficcaglia commented 5 years ago

Yes we discussed that in the call, and I asked much the same question - I assumed there must be a symbolic specification too - but I think the idea was that the policy itself is the "specification" and the system behavior is the "program" to verify. I may have misunderstood what AWS was saying, though. To your point, what AWS used as an example was things like priv escalation in IAM, so more like consistency checks in the policy. So the use cases discussed here may be a bridge too far.

On Thu, Jun 6, 2019 at 6:58 AM Justin Cormack notifications@github.com wrote:

Sorry, this is somewhat unclear. If I write a policy, who is writing the specification it is checked against? I think the summary needs spelling out in more detail.

There are actually interesting things you can check on a single policy, but they are not correctness. eg AWS has done some work on finding parts of a policy that are "dead code" as they probably indicate mistakes, and finding policies that map to "probably unintended things" eg world writeable S3 buckets (potentially a safety property).

— You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub https://github.com/cncf/sig-security/issues/196?email_source=notifications&email_token=AAGENIWYS2WJXMMEICUGAZTPZEJY5A5CNFSM4HU2AC72YY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGODXC5QDY#issuecomment-499505167, or mute the thread https://github.com/notifications/unsubscribe-auth/AAGENIVPZXT2DMFSH2OIYSLPZEJY5ANCNFSM4HU2AC7Q .

justincormack commented 5 years ago

I wasn't on the call but just caught up. I think the system behaviour is different though. I think writing up some quite detailed concrete examples would help understand what we think we want to do and what is feasible.

rficcaglia commented 5 years ago

one detail I may have glossed over...in the call Howard presented slides that talked about a "policy symbolic graph": https://docs.google.com/presentation/d/1DSxm7IqVnweJbsGSGyhrhB1daR1E0hj4Ko-D4mvyxFk/edit#slide=id.g585a5014dd_0_17

Though we didn't discuss that in any detail, that might be the symbolic specification that he had in mind for checking the policy?

On Thu, Jun 6, 2019 at 7:26 AM Justin Cormack notifications@github.com wrote:

I wasn't on the call but just caught up. I think the system behaviour is different though. I think writing up some quite detailed concrete examples would help understand what we think we want to do and what is feasible.

— You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub https://github.com/cncf/sig-security/issues/196?email_source=notifications&email_token=AAGENIVNTDMSKCIK2NTUOS3PZENBBA5CNFSM4HU2AC72YY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGODXDAOYQ#issuecomment-499517282, or mute the thread https://github.com/notifications/unsubscribe-auth/AAGENIXR26MREYZJEFHKXGTPZENBBANCNFSM4HU2AC7Q .

copumpkin commented 5 years ago

(I was the AWS person on the call 👋)

Thanks for writing this up @rficcaglia! Regarding @justincormack's question and this bullet:

A tool checks "validity" of the policy and that the system execution matches the specification. produces verification "proof" (model) if the policy is correct

I wouldn't phrase it that way. The way I like to think about it is that we have three things going on in a system:

The high-level human intent behind the policy
The policies they wrote (ostensibly) to achieve that intent
The code that evaluates those policies

While it's possible to formally verify with sufficient effort that (3) is implemented properly (which is I think what you're getting at in the bullet I quoted above), what I was suggesting was the "low-hanging fruit" of validating (1) against (2).

Here's an example: say you have policy in many parts of a system to implement the desired access structure for a business. Although policy is more succinct than arbitrary code, it can still be gnarly for humans to understand, and tends to accrete weird special cases over time. But we also often have simple high-level invariants we'd like to hold over our policy: asserting that nobody can escalate privileges except by permitted auditable mechanisms, or asserting things about all principals with the power to affect the availability of the system. You can think of these almost like logical "tests" of your policy: if we didn't have automated reasoning, we might craft a handful of cases we want to hold true, like "Joe in accounting can't reconfigure my production infrastructure" or "Jane the SRE needs access to production logs"; but these logical tools allow us to write "universal tests" that can test conceptually infinite situations in finite time, so instead of concretizing our tests, we now say things like "nobody outside of the SRE group can access production logs" and ask the system if that's true. Narrow but well defined invariants like those take us a step closer to validating (1) against (2), even though (1) is often very fuzzy and informal and impossible to fully specify in general.

So back to @justincormack's question:

If I write a policy, who is writing the specification it is checked against?

I'd propose that k8s security experts write out a model of how e.g., privilege escalation might work in k8s. The average user, even though they care about privilege escalation, can't be expected to understand every dark corner of the API or transitive concerns arising from it. The model would include things like direct privilege escalation "within" the policy engine, but potentially also things like the power to reconfigure the policy engine itself or to otherwise affect the execution of policy. End users then get a simplified interface to make assertions over their policies in terms of those models constructed by the experts. The models could also be parametrized, giving users the power to assert things like "no principal can escalate privileges, except those in this group", and so on.

Of course, there are plenty of other designs that could work, but this is the direction I was proposing on the call.

timothyhinrichs commented 5 years ago

Agree that a concrete model of "privilege escalation" or "multi-tenancy" are really important here, especially from a methodological perspective. Different technologies in the verification space help us do different kinds of verification. So step 1 is understanding what kind of verification we want to do and therefore what kind of technology we need. In the end what often happens is that we figure out what's possible to verify, and as @copumpkin says parameterize it to the extent we can. From what I've seen, it usually ends up being that the value is less that a user can come along and write their OWN model/query; rather, it's that the user provides their OWN policy to be checked against a fixed (and possibly parameterized) model/query.

Examples. SMT solvers (isn't that what you're using @copumpkin?) are good at finding inputs to a policy that generate a particular class of policy decisions. Model Checkers will find a sequence of inputs that result in a particular class of policy decisions. Theorem provers will guarantee a policy decision is always "true" over all possible inputs. (Here by 'input' think of OPA's input + external data.) Moreover, the devil's often in the details for which kind of technology you use: SMT solvers focus on particular kinds of builtin-functions like arithmetic, regexps, and so on, whereas many theorem provers (last time I looked) often don't have any built-in support for those--requiring you to axiomatize everything you need.

The multi-tenancy verification problem sounds like theorem proving: for all inputs, everyone from group A can access namespace A's resources, but no other namespace's resources. Not sure what privilege escalation means, but it sounds like the answer will be multi-step, pointing to model-checking.

Alternatively, we could start by looking at a particular tool to understand what kind of verification it can do, try out some examples, and go from there. Z3 pops to mind as a good place to start--a bit of SMT and maybe theorem proving if I remember right.

rficcaglia commented 5 years ago

I was not on the Policy WG call but noting use case sketches discussed by @ericavonb

operator privilege escalation (e.g. can I create a resource that can escalate and create new resources)
gatekeeper cluster state cache and how actions are evaluated based on state change
Tenant CRD certificate creation use cases

I would also add the other discussion on the call about mapping PSP to Gatekeeper to identify gaps:

Identify gaps in PSPs vs. Gatekeeper Constraint templates

Also link to the Google Group thread: https://groups.google.com/d/topic/kubernetes-wg-policy/JLz1zZgi_vk/discussion

rficcaglia commented 5 years ago

pending discussion and feedback today (7/24/19) on the Policy WG call, I'll submit a PR which I would consider action enough to close this issue.

cncf / tag-security

Kubernetes Policy WG Discussion: Formal Verification #196

Formal Verification of Policy In Kubernetes

How is Verification Done?