Rolling a cluster from Kubernetes 1.30 to 1.31 gets stuck in a validation loop when new nodes are added to the cluster via CAS/Karpenter after `kops update cluster` completes #16907

danports · 2024-10-16T04:18:57Z

/kind bug

1. What kops version are you running? The command kops version, will display
this information.
1.31.0-alpha.1

2. What Kubernetes version are you running? kubectl version will print the
version if a cluster is running or provide the Kubernetes version specified as
a kops flag.
Upgrading from 1.30.5 to 1.31.1.

3. What cloud provider are you using?
AWS

4. What commands did you run? What is the simplest way to reproduce this issue?
Update the cluster kubernetesVersion and then run:
kops update cluster
kops rolling-update cluster

5. What happened after the commands executed?
The rolling-update got stuck in a validation loop and eventually timed out, because pods on the new worker nodes created by Karpenter after kops update cluster failed to start as described in kubernetes/kubernetes#127316.

6. What did you expect to happen?
Would have been great if the rolling update completed without errors.

7. Please provide your cluster manifest. Execute
kops get --name my.example.com -o yaml to display your cluster manifest.
You may want to remove your cluster name and other sensitive information.
Only relevant part here is having Karpenter enabled and then upgrading the Kubernetes version to 1.31.1.

8. Please run the commands with most verbose logging by adding the -v 10 flag.
Paste the logs into this report, or in a gist and provide the gist link here.
Rolling update validation loop outputs things like this over and over:

I1016 03:06:22.203255    2989 instancegroups.go:567] Cluster did not pass validation, will retry in "30s": node "i-05f95c0b6ad6e5201" of role "node" is not ready, system-node-critical pod "calico-node-ct25f" is pending, system-node-critical pod "ebs-csi-node-sm8v6" is pending, system-node-critical pod "efs-csi-node-bmdvv" is pending.

Upon describing one of those pods:

  Warning  Failed     25m (x12 over 27m)     kubelet            Error: services have not yet been read at least once, cannot construct envvars

9. Anything else we need to know?
It should be possible to work around this issue by pausing autoscaling before kops update cluster until after kops rolling-update cluster has replaced all of the control plane nodes, or with judicious use of kops rolling-update cluster --cloudonly.

The text was updated successfully, but these errors were encountered:

rifelpet · 2024-10-18T16:32:31Z

Some options discussed in office hours:

Extend the kops update cluster --phase concept to conditionally apply tasks for just control plane vs nodes. perhaps with an --instance-group-role field to match terminology in kops rolling-update cluster
Implement an "uber command" that runs both update cluster --yes and rolling-update cluster --yes together, allowing for the sequence of task applies to handled internally. This could be a new flag in kops rolling-update cluster or kops upgrade cluster.
Add kubernetesVersion API field to InstanceGroupSpec to allow control plane to be upgraded independently of nodes, even with a traditional kops update cluster --yes

We'll likely start with the first option and see how the ergonomics of the second option feel, given that it depends on the first option.

In either case we'll add upgrade instructions to the release notes for this new behavior.

/kind blocks-next

k8s-ci-robot · 2024-10-18T16:32:33Z

@rifelpet: The label(s) kind/blocks-next cannot be applied, because the repository doesn't have them.

In response to this:

Some options discussed in office hours:

Extend the kops update cluster --phase concept to conditionally apply tasks for just control plane vs nodes. perhaps with an --instance-group-role field to match terminology in kops rolling-update cluster

Implement an "uber command" that runs both update cluster --yes and rolling-update cluster --yes together, allowing for the sequence of task applies to handled internally. This could be a new flag in kops rolling-update cluster or kops upgrade cluster.

Add kubernetesVersion API field to InstanceGroupSpec to allow control plane to be upgraded independently of nodes, even with a traditional kops update cluster --yes

We'll likely start with the first option and see how the ergonomics of the second option feel, given that it depends on the first option.

In either case we'll add upgrade instructions to the release notes for this new behavior.

/kind blocks-next

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

rsafonseca · 2024-10-31T12:34:25Z

Another option, and possibly a more correct one, would be to enforce the version skew https://kubernetes.io/releases/version-skew-policy/#kubelet

As such, the userdata for instance groups shouldn't be updated, until the control plane is already rolled out to a newer version, thus ensuring that we never have nodes coming up with a kubelet version that is more recent than any control plane node.

E.g: kops update cluster would:

Check running control-plane versions
Update instance-groups kubernetes version only if the lowest running version of a control plane node is >= target version

In this situation:

you don't need additional command switches in kops update cluster
for new clusters or those without any currently running control plane nodes the change would be transparent
simply re-run kops update cluster after your control plane is rolled out

Optionally, similar to the suggested above, a flag for going through the whole procedure like --sync or --wait could:

update the controlplane igs' spec
rollout the control plane
update the other instancegroups' spec

danports · 2024-12-17T18:25:46Z

Aside from the minor #17146, everything worked beautifully when upgrading to Kubernetes 1.31 today with kOps 1.31.0-beta.1 - thanks all and especially @justinsb!

k8s-ci-robot added the kind/bug Categorizes issue or PR as related to a bug. label Oct 16, 2024

rifelpet added the kind/office-hours label Oct 18, 2024

rifelpet added the blocks-next label Oct 18, 2024

rsafonseca mentioned this issue Nov 4, 2024

Fix version upgrade kubelet support #16932

Merged

k8s-ci-robot closed this as completed in #16932 Dec 4, 2024

stl-victor-sudakov mentioned this issue Jan 10, 2025

Add docs for the new kops reconcile cluster command #17191

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Rolling a cluster from Kubernetes 1.30 to 1.31 gets stuck in a validation loop when new nodes are added to the cluster via CAS/Karpenter after `kops update cluster` completes #16907

Rolling a cluster from Kubernetes 1.30 to 1.31 gets stuck in a validation loop when new nodes are added to the cluster via CAS/Karpenter after `kops update cluster` completes #16907

danports commented Oct 16, 2024

rifelpet commented Oct 18, 2024

k8s-ci-robot commented Oct 18, 2024

rsafonseca commented Oct 31, 2024

danports commented Dec 17, 2024

Rolling a cluster from Kubernetes 1.30 to 1.31 gets stuck in a validation loop when new nodes are added to the cluster via CAS/Karpenter after kops update cluster completes #16907

Rolling a cluster from Kubernetes 1.30 to 1.31 gets stuck in a validation loop when new nodes are added to the cluster via CAS/Karpenter after kops update cluster completes #16907

Comments

danports commented Oct 16, 2024

rifelpet commented Oct 18, 2024

k8s-ci-robot commented Oct 18, 2024

rsafonseca commented Oct 31, 2024

danports commented Dec 17, 2024

Rolling a cluster from Kubernetes 1.30 to 1.31 gets stuck in a validation loop when new nodes are added to the cluster via CAS/Karpenter after `kops update cluster` completes #16907

Rolling a cluster from Kubernetes 1.30 to 1.31 gets stuck in a validation loop when new nodes are added to the cluster via CAS/Karpenter after `kops update cluster` completes #16907