You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hurricane provider stops responding over 8443/status, 8444 sporadically (either immediately after start or after some time) since upgrading it from 0.4.8 to 0.5.4
NOTE: AKASH_IP_OPERATOR=false, akash ip operator helm chart not present (normally IP Leasing would be enabled, but I've disabled it as I've initially thought it was causing the problem)
No restarts nor issues since the last time provider was started (26hrs uptime).
I'll let it run like this for over the weekend and will enable the IP Leasing back again.
Hurricane provider stops responding over 8443/status, 8444 sporadically (either immediately after start or after some time) since upgrading it from
0.4.8
to0.5.4
Logs
after provider pod restarted - provider simply did not respond over 8443/status, 8444 right away:
hurricane-0.5.4-NOT-responding-over-8443-8444-right-away.log
provider has been running for some time and then stopped responding over 8443/status, 8444:
hurricane-0.5.4-stopped-responding-over-8443-8444-right-after-inventory-MISSING-IP-operator-false-no-ip-operator-chart.log
Workarounds
I've implemented automatic provider pod restart if livenessProbe finds it cannot get the data from 8443/status, etc
Will keep monitoring the akash-provider pod restart count.
Additional notes
I have not observed this issue on any other provider except for the Hurricane provider since we've upgraded providers from 0.4.8 to 0.5.4.
The text was updated successfully, but these errors were encountered: