bstick12 / kubo-bosh-lite

11 stars 9 forks source link

flanneld job fails to start using stemcell 3468.5 #4

Open stephendotcarter opened 7 years ago

stephendotcarter commented 7 years ago

I followed the installation steps but flanneld on the master node failed to start:

Task 6 | 08:41:46 | Updating instance master: master/ea72250c-d7e9-4043-a291-aff24fac592d (0) (canary) (00:05:30)
                   L Error: 'master/ea72250c-d7e9-4043-a291-aff24fac592d (0)' is not running after update. Review logs for failed jobs: flanneld

On checking the flanneld logs I found the following:

+ echo '------------ STARTING flanneld_ctl at Tue Dec  5 08:44:09 UTC 2017 --------------'
+ '[' '!' -f /var/vcap/sys/run/flanneld/flanneld.pid ']'
+ local pid
++ head -1 /var/vcap/sys/run/flanneld/flanneld.pid
+ pid=4945
+ pid_is_running 4945
+ declare pid=4945
+ ps -p 4945
+ __log 'Removing stale pidfile'
+ echo 'Removing stale pidfile'
+ rm /var/vcap/sys/run/flanneld/flanneld.pid
+ echo 4956
+ start_flanneld
+ modprobe br_netfilter
modprobe: ERROR: ../libkmod/libkmod.c:556 kmod_search_moddep() could not open moddep file '/lib/modules/4.4.0-101-generic/modules.dep.bin'

The line of code that fails was added in kubo-release v0.8.1:

Seems like an issue with the BOSH Lite stemcell 3468.5 Using 3468.11 works around the issue.

bstick12 commented 6 years ago

The stemcell version used must be the same as the stemcell version used by the bosh deployment.

Looking into a way to getting the bosh deployment stemcell version programmatically so I can use that in the upload-stemcell command.