openshift / lvm-operator

The LVM Operator deploys and manages LVM storage on OpenShift clusters
Apache License 2.0
42 stars 37 forks source link

No news disks are added to the LVMCluster/LVMvolumegroupnodestatuses #105

Closed jgato closed 2 years ago

jgato commented 2 years ago

I had one SNO with an LVMCluster created to manage the VG vg1 (/dev/nvme0n1, /dev/nvme1n1, /dev/sda). I wanted to test the addition of new disks. So, I rebooted and created some more disks from the raid I have in the server.

After creating some more disks the SNO is rebooted and:

 * This new disks are not recognized by the LVMCluster/LVMvolumegroupnodestatuses:

$ oc get lvmvolumegroupnodestatuses -o yaml apiVersion: v1 items:



I have created a second LVMCluster because I thought this would collect the new disks. Now I understand it is only one LVMCluster supported, would this have interfere?  
nbalacha commented 2 years ago

I have created a second LVMCluster because I thought this would collect the new disks. Now I understand it is only one LVMCluster supported, would this have interfere?

It should not. Do you see any errors in the controller-manager or vgmanager logs files?

jgato commented 2 years ago

From the controller-manager:

{
  "level": "error",
  "ts": 1644229480.7411144,
  "logger": "controller.lvmcluster.lvmcluster-controller",
  "msg": "failed to create or update vgManager daemonset",
  "reconciler group": "lvm.topolvm.io",
  "reconciler kind": "LVMCluster",
  "name": "vg-manager",
  "namespace": "lvm-operator-system",
  "error": "failed to update controller reference on vgManager daemonset \"vg-manager\". Object lvm-operator-system/vg-manager is already owned by another LVMCluster controller lvmcluster-sample",
  "stacktrace": "github.com/red-hat-storage/lvm-operator/controllers.(*LVMClusterReconciler).Reconcile\n\t/workspace/controllers/lvmcluster_controller.go:104\nsigs.k8s.io/controller-runtime/pkg/internal/controller.(*Controller).Reconcile\n\t/go/pkg/mod/sigs.k8s.io/controller-runtime@v0.10.2/pkg/internal/controller/controller.go:114\nsigs.k8s.io/controller-runtime/pkg/internal/controller.(*Controller).reconcileHandler\n\t/go/pkg/mod/sigs.k8s.io/controller-runtime@v0.10.2/pkg/internal/controller/controller.go:311\nsigs.k8s.io/controller-runtime/pkg/internal/controller.(*Controller).processNextWorkItem\n\t/go/pkg/mod/sigs.k8s.io/controller-runtime@v0.10.2/pkg/internal/controller/controller.go:266\nsigs.k8s.io/controller-runtime/pkg/internal/controller.(*Controller).Start.func2.2\n\t/go/pkg/mod/sigs.k8s.io/controller-runtime@v0.10.2/pkg/internal/controller/controller.go:227"
}
{
  "level": "error",
  "ts": 1644229480.7504883,
  "logger": "controller.lvmcluster",
  "msg": "Reconciler error",
  "reconciler group": "lvm.topolvm.io",
  "reconciler kind": "LVMCluster",
  "name": "lvmcluster-sample-2",
  "namespace": "lvm-operator-system",
  "error": "failed reconciling: vg-manager failed to update controller reference on vgManager daemonset \"vg-manager\". Object lvm-operator-system/vg-manager is already owned by another LVMCluster controller lvmcluster-sample",
  "stacktrace": "sigs.k8s.io/controller-runtime/pkg/internal/controller.(*Controller).Start.func2.2\n\t/go/pkg/mod/sigs.k8s.io/controller-runtime@v0.10.2/pkg/internal/controller/controller.go:227"
}

So it could be the second LVMCCluster creation could be causing problems. Do you want me to delete it?

No errors in the vgmanager.

nbalacha commented 2 years ago

So it could be the second LVMCCluster creation could be causing problems. Do you want me to delete it?

Yes, please delete the second LVMCluster.

jgato commented 2 years ago

Ok, I have cleaned up everything. The second LVMCluster was causing problems and it is something not supported yet. I have repeated the process: Phase 1)

 * But still not recognizable by the LVMCluster

Device Class Statuses: Name: vg1 Node Status: Devices: /dev/nvme0n1 /dev/nvme1n1 /dev/sda /dev/sdc /dev/sdd /dev/sdg