nginxinc / nginx-gateway-fabric

NGINX Gateway Fabric provides an implementation for the Gateway API using NGINX as the data plane.
Apache License 2.0
497 stars 95 forks source link

Separate control and data planes #292

Closed f5yacobucci closed 10 months ago

f5yacobucci commented 1 year ago

As a route to efficacy and quickly understanding the Gateway API; its implementation and alignment to NGINX as a data plane, we decided on a simplified, but rigid, deployment pattern. To improve our security posture and installation flexibility the control and data planes should be separated as semi-autonomous, distributed components.

Problem Control plane containers and data plane containers compose into a single Kubernetes Pod. Control plane containers use OS signals and file system sharing to exchange data. Control plane and data plane are governed by the same RBAC policies as they reside in one Pod and ServiceAccount. Control plane and data plane must scale dependently and cannot scale on independent axis. Compromise of the control plane may impact customer traffic in the data plane. Compromise of the data plane may expose Kubernetes API server and impact the cluster and allow horizontal movement in the network. Kubernetes secrets and sensitive data will be shared across containers unnecessarily. Violation of a basic zero-trust tenet: "The data plane and control plane are logically separated." NIST SP 800-207

Solution Separate control and data planes across independent Pods and Deployments.

A/C:

Create design document outlining new architecture and deployment pattern. Should cover communication channels between control and data planes, authentication and authorization, interfaces, and data plane agent options. Both planes should eventually scale, only data plane is required for the first iteration.

### Stories
- [x] #344
- [x] #376 
- [x] #377 
- [ ] #378 
- [ ] #379
- [ ] #375   
- [ ] #380 
- [ ] #381 
- [ ] #374 
- [ ] #382 
- [ ] #459

Out of scope:

Aha! Link: https://nginx.aha.io/features/NKG-72

mpstefan commented 1 year ago

I think before we pick this back up, we may want do two things:

  1. Define the value to the user of this new architecture
  2. Define the architecture we want to achieve with this change, or over time
mpstefan commented 9 months ago

Mistakenly marked as complete. Moving this functionality to a new epic.