Closed andy108369 closed 3 months ago
No restarts nor issues since the last time provider was started (26hrs uptime). I'll let it run like this for over the weekend and will enable the IP Leasing back again.
Awaiting further testing by @andy108369 prior to further investigation
Enabled the IP Leasing back again:
provider.yaml
ipoperator: true
installed metallb chart and applied the config
helm upgrade --install metallb metallb/metallb -n metallb-system --version 0.14.3
kubectl apply -f metallb-config.yaml
installed akash-ip-operator chart
helm upgrade --install akash-ip-operator akash/akash-ip-operator -n akash-services --set provider_address=akash15tl6v6gd0nte0syyxnv57zmmspgju4c3xfmdhk
can't see this issue any longer with provider 0.5.11 closing.
Hurricane provider stops responding over 8443/status, 8444 sporadically (either immediately after start or after some time) since upgrading it from
0.4.8
to0.5.4
Logs
after provider pod restarted - provider simply did not respond over 8443/status, 8444 right away: hurricane-0.5.4-NOT-responding-over-8443-8444-right-away.log
provider has been running for some time and then stopped responding over 8443/status, 8444: hurricane-0.5.4-stopped-responding-over-8443-8444-right-after-inventory-MISSING-IP-operator-false-no-ip-operator-chart.log
Workarounds
I've implemented automatic provider pod restart if livenessProbe finds it cannot get the data from 8443/status, etc
Will keep monitoring the akash-provider pod restart count.
Additional notes
I have not observed this issue on any other provider except for the Hurricane provider since we've upgraded providers from 0.4.8 to 0.5.4.