CrowdStrike / falcon-operator

https://artifacthub.io/packages/olm/falcon-operator/falcon-operator
Apache License 2.0
49 stars 37 forks source link

Opeshift 4.14 unable to deploy FalconNodeSensor #564

Open siwyroot opened 3 months ago

siwyroot commented 3 months ago

Hello, I'm trying to deploy FalconNodeSensor on OpenShift 4.14, operator installed ok but I cannot create FalconNodeSensor as I get strange error: Not Found. Latest version of operator from OperatorHub

image

image

siwyroot commented 3 months ago

Removed old operator, crds, roles, roles bindings, sa and everything I could find, reinstalled operator, still gives me a finger.

n00bsi commented 3 months ago

@siwyroot

have a similar problem - installation hangs - thanks for your hint

I checked via ComandLine and found some entrys

$ ./oc get ns | grep -i falc
$ 

$ ./oc get sa | grep -i falc
$ 

$ ./oc get crd | grep -i falc
$ 

$ ./oc get roles | grep -i falc
$ 

If there any entrys - delete

./oc delete .....

example:

./oc delete crd falconadmissions.falcon.crowdstrike.com

But now I like to install the Node Sensor:

node_sensor_install_1

node_sensor_install_2

siwyroot commented 3 months ago

@n00bsi to fix this:

  1. Uninstall operator
  2. delete all crds with falcon in name
  3. delete permission related objects oc delete $(oc get clusterrole,clusterrolebinding -l crowdstrike.com/created-by=falcon-operator -o name)
  4. Depending on your OS you have to delete ~/.kube/config if you are using flux/argocd restart pods that apply yamls
  5. Reinstall operator, not you can apply node sensor via cmd (should work via UI) pods will start in different namespace now (falson-system) which will be created by operator
redhatrises commented 3 months ago

Assuming that the resources where cleanup before the operator was uninstalled, there is an issue in OpenShift where the resources are not always cleaned up internally in a timely manner which requires you to wait for a while for the kube api to cleanup delete resources internally in openshift before it will allow the operator to work. Also you must clean up all the resources on the cluster. See https://github.com/CrowdStrike/falcon-operator/tree/main/docs/deployment/openshift#uninstall-the-operator-1