Closed pontal4 closed 3 weeks ago
Hi, you mixed different configs for different CCMs
start from Talos CCM fist, since it responsible for initializing process.
TalosCCM with node-ipam-controller
requires useDaemonSet
CCM IPAM makes sense only with IPv6 or dual stack environments. https://github.com/siderolabs/talos-cloud-controller-manager/blob/main/docs/controllers.md#node-ipam
# Helm values
useDaemonSet: true
logVerbosityLevel: 5
enabledControllers:
- cloud-node
- node-ipam-controller
extraArgs:
- --allocate-node-cidrs
- --cidr-allocator-type=CloudAllocator
- --node-cidr-mask-size-ipv4=24
- --node-cidr-mask-size-ipv6=80
Talos CCM without Node IPAM
# Helm values
useDaemonSet: true
logVerbosityLevel: 5
enabledControllers:
- cloud-node
# Talos machine config
cluster:
allowSchedulingOnControlPlanes: true
controllerManager:
extraArgs:
# Disable node IPAM controller, if you use node-ipam-controller on CCM side
controllers: "*,tokencleaner,-node-ipam-controller"
machine:
kubelet:
extraArgs:
cloud-provider: external
features:
kubernetesTalosAPIAccess:
enabled: true
allowedRoles:
- os:reader
allowedKubernetesNamespaces:
- kube-system
Proxmox CCM deployment for talos - https://github.com/sergelogvinov/proxmox-cloud-controller-manager/blob/main/docs/deploy/cloud-controller-manager-talos.yml values: https://github.com/sergelogvinov/proxmox-cloud-controller-manager/blob/main/charts/proxmox-cloud-controller-manager/values.talos.yaml
Yeah, example here is not so clear https://github.com/siderolabs/talos-cloud-controller-manager/blob/main/docs/install.md
Hello, Thanks for your reply and your project, For context, I mixed a lot of things from a lot of documentation, thoses tools are hard to use. I would like to use Talos CCM for (i don't know), and the Proxmox CCM to tag correctly my nodes, and link correctly my PVC (i'm using proxmox-csi-driver). I check one of your repo, and you do something like this: https://github.com/sergelogvinov/terraform-talos/blob/main/proxmox/deployments/talos-ccm.yaml
I do the changes you say, but it's the same issue. Do I have to enable rotation of server certs?
Yeah, if you use kubernetes metrics better to enable certificate rotation.
If you do not use Talos CCM features, you can use only Proxmox CCM with
enabledControllers:
- cloud-node
- cloud-node-lifecycle
and add only this patch to talos machine config
machine:
kubelet:
extraArgs:
cloud-provider: external
Unfortunately https://github.com/sergelogvinov/terraform-talos sometimes has a bug, since it is my dev/research environment
If I don't use the Talos CCM, my nodes are not getting labels correctly. Do you have a sample setup who works on a clustered Proxmox with multiples nodes, with CCM ? I do some tests, and sometimes, I got the error message for only one control plane, while the two others are working correctly. This feature seems hard to implement, but it's really necessary for a cluster
I just find an "alternative", I set the talos ccm replica count to one :
replicaCount: 1
enabledControllers:
- cloud-node
- node-csr-approval
Pod had to restart multiple times before going "Running", but I guess it will works. Thanks you !
I also have some issues with proxmox-csi plugin "GRPC error: failed to get node worker-01: Unauthorized" So, my talos configuration is probably not right
It’s hard to tell from the logs 'GRPC error: failed to get node worker-01: Unauthorized.' But it looks like the Kubernetes service account might not have the right permissions to make the API call. Please check your deployment settings, as there might be a mistake in the configuration
Please try recreating the cluster.
Hello, After migrating to Talos 1.8.1, everything seems to works perfectly, some pods need to be restarted 2 or 3 times but it's not really a big issue now. Thanks for your time!
Bug Report
Description
Hello ! I got this issue using CCM with Talos on a Proxmox cluster. I have 3 control-plane, and 5 workers.
The proxmox cluster have two nodes : pve01 -> cp-01, cp-03, workers-... pve02 -> cp-02, workers-...
It is working perfectly on cp-01, but not on cp-02 and cp-03.
Logs
Environment
Image tag: ghcr.io/sergelogvinov/talos-cloud-controller-manager@sha256:fae728f874882406ad15686bdd2c556336b3406d7896f75fe223582f3d75f20f (nodeipam)
Kubernetes version: 1.31.1
OS version [
cat /etc/os-release
] : Talos v1.8Deployment (using Ansible) :
Values :
The secret is created on the control-plane inline-manifests :