Borrows from semaphore-service-mirror and wiresteward.
An experimental WireGuard peer manager meant to run as a Kubernetes DaemonSet that peers local and remote cluster nodes in a full mesh.
Each node has a WireGuard interface per remote cluster with all remote cluster's peers on it. Routing is done using the WireGuard interface and a single route created on the host for the whole remote Pod subnet. It does not clean up network configuration on teardown, so restarts can go unnoticed but devices are synced on startup.
Usage of ./semaphore-wireguard:
-clusters-config string
Path to the clusters' json config file
-listen-address string
Listen address to serve health and metrics (default ":7773")
-log-level string
Log level (default "info")
-node-name string
(Required) The node on which semaphore-wireguard is running
-wg-key-path string
Path to store and look for wg private key (default "/var/lib/semaphore-wireguard")
Semaphore-wireguard is developed against Kubernetes clusters which use Calico
CNI and thus relies on a few Calico concepts in order to function. Moreover,
the daemonset pods read the PodCIDR
field from Kubernetes Node resources in
order to determine the allowed IPs via each WireGuard interface created. As a
result, this is tested to work using the host-local
IPAM with Calico:
"ipam": {
"type": "host-local",
"subnet": "usePodCidr"
},
which will make sure that pods scheduled in a node will be allocated IP
addresses from the value stored in the node's PodCIDR
.
A json config is expected to define all the needed information regarding the local and the remote clusters that semaphore-wireguard operates on.
name
the name of the local cluster. This will be used by the controller to
look for WireGuard configuration in remote nodes' annotations, with the
following pattern: <name>.wireguard.semaphore.uw.io
.List of remote clusters that may define the following:
name
The name of the remote cluster. This will be used when creating the
local WireGuard interfaces (wireguard.<name>
) and annotations to expose the
needed configuration for remote clusters controllers.
remoteAPIURL
The kube apiserver URI for the remote cluster.
remoteCAURL
Endpoint to fetch the remote cluster's CA.
remoteSATokenPath
Path to the ServiceAccount token that would allow watching
the remote cluster's nodes.
kubeConfigPath
Path to a kube config file. This is an alternative for the
3 above configuration options.
podSubnet
The cluster's Pod subnet. Will be used to configure a static route
to the subnet via the created wg interface. Pod subnets should be unique
across the configuration, so that routes to different clusters pods do not
overlap. As a result, clusters which use the same subnet for pods cannot be
paired.
wgDeviceMTU
MTU for the created WireGuard interface.
wgListenPort
WG listen port, remote cluster nodes should be able to reach
this.
resyncPeriod
Kubernetes watcher resync period. It should yield update events
for everything that is stored in the cache. Default 0
value disables it.
Cluster names should be unique and consistent across configuration of different
deployments that live in different clusters. For example, if we pick cluster1
as our local cluster name for a particular deployment, the same name should be
set as the remote cluster name in other deployments that will try to pair with
the local cluster.
We are using the configured cluster names to construct the respective WireGuard
interfaces on the host, prefixing names with wireguard.
. Because there is a
limit on how many chars length the interfaces can be, our prefix allows the
user to define cluster names with up to 6 characters, otherwise a validation
error will be raised.
Cluster1 (c1
) configuration:
{
"local": {
"name": "c1"
},
"remotes": [
{
"name": "c2",
"remoteAPIURL": "https://lb.c2.k8s.uw.systems",
"remoteCAURL": "https://kube-ca-cert.c2.uw.systems",
"remoteSATokenPath": "/etc/semaphore-wireguard/tokens/c2/token",
"podSubnet": "10.4.0.0/16",
"wgDeviceMTU": 1380,
"wgListenPort": 51821
}
]
}
Cluster2 (c2
) configuration:
{
"local": {
"name": "c2"
},
"remotes": [
{
"name": "c1",
"remoteAPIURL": "https://lb.c1.k8s.uw.systems",
"remoteCAURL": "https://kube-ca-cert.c1.uw.systems",
"remoteSATokenPath": "/etc/semaphore-wireguard/tokens/c1/token",
"podSubnet": "10.2.0.0/16",
"wgDeviceMTU": 1380,
"wgListenPort": 51821
}
]
}
In order for calico to accept traffic to/from remote pod subnets we need to
define Calico IPPools for all the subnets we want to allow. In the above
example, each cluster will need a local definition of the remote cluster's
IP address pool. So in c1
we need to define the following:
apiVersion: projectcalico.org/v3
kind: IPPool
metadata:
name: c2-pods
spec:
cidr: 10.4.0.0/16
ipipMode: CrossSubnet
disabled: true
and the relevant definition in c2
:
apiVersion: projectcalico.org/v3
kind: IPPool
metadata:
name: c1-pods
spec:
cidr: 10.2.0.0/16
ipipMode: CrossSubnet
disabled: true
Beware that disabled: true
is necessary here, in order for Calico to avoid
giving local pods IP addresses from the pool defined for remote workloads.
Pod to pod communication should still be subject to network policies deployed on each cluster. One can start with wide policies that would allow the full remote subnet to their local pods, although we recommend trying our policy operator.
See example kube manifests to deploy under example dir.