gchq / gaffer-docker

Gaffer Docker images and associated Helm charts for deploying on Kubernetes
https://gchq.github.io/gaffer-docker
Apache License 2.0
30 stars 36 forks source link

Helm chart installation failure and missing JAVA_OPTS parameters #328

Open DavidAnDavis2021 opened 10 months ago

DavidAnDavis2021 commented 10 months ago

After setting up my kind nodes, doing a

cd kubernetes/gaffer-road-traffic
helm install --debug gaffer .

I get the following errors:

client.go:796: [debug] Pod gaffer-accumulo-post-install-cmds running                                                                                                                          
Error: INSTALLATION FAILED: failed post-install: 1 error occurred:                                                                                                                            
        * timed out waiting for the condition                                                                                                                                                 

helm.go:84: [debug] failed post-install: 1 error occurred:                                                                                                                                    
        * timed out waiting for the condition                                                                                                                                                 

INSTALLATION FAILED                                                                                                                                                                           
main.newInstallCmd.func2                                                                                                                                                                      
        helm.sh/helm/v3/cmd/helm/install.go:147                                                                                                                                               
github.com/spf13/cobra.(*Command).execute                                                                                                                                                     
        github.com/spf13/cobra@v1.6.1/command.go:916                                                                                                                                          
github.com/spf13/cobra.(*Command).ExecuteC                                                                                                                                                    
        github.com/spf13/cobra@v1.6.1/command.go:1044                                                                                                                                         
github.com/spf13/cobra.(*Command).Execute                                                                                                                                                     
        github.com/spf13/cobra@v1.6.1/command.go:968                                                                                                                                          
main.main                                                                                                                                                                                     
        helm.sh/helm/v3/cmd/helm/helm.go:83                                                                                                                                                   
runtime.main                                                                                                                                                                                  
        runtime/proc.go:250                                                                                                                                                                   
runtime.goexit                                                                                                                                                                                
        runtime/asm_amd64.s:1598                      

Looking at the status of the pods I see many of them are stuck in a crash loop back off state. Looking at the logs of the gaffer-accumulo-master-0, I see

(base) ubuntu@ip-XXX:~/vendor/gaffer-docker/kubernetes/gaffer-road-traffic$ kubectl logs gaffer-accumulo-masters-0
Fri Sep  8 17:44:32 UTC 2023 - Waiting for all HDFS instances to be ready...                                                           
2023-09-08 17:44:34,226 [util.NativeCodeLoader] WARN : Unable to load native-hadoop library for your platform... using builtin-java classes where applicable
2023-09-08 17:44:35,115 [util.NativeCodeLoader] WARN : Unable to load native-hadoop library for your platform... using builtin-java classes where applicable
hdfs://gaffer-hdfs-namenode-0.gaffer-hdfs-namenodes:8020/accumulo - Safe mode is OFF - writeable           
Initializing Accumulo...
/opt/accumulo/bin/accumulo: line 48: JAVA_OPTS: variable is not set in accumulo-env.sh
/opt/accumulo/bin/accumulo: line 48: JAVA_OPTS: variable is not set in accumulo-env.sh

There may be other errors as well causing other pods to fail as well.