feat: Add options to configure consolidation timeouts #1031

domgoodwin · 2024-02-20T11:14:34Z

Issue: #903

Description
Adds option values to configure the timeout for multi and single node consolidations. Depending on the size of a cluster and your various affinities/topology spreads this can take longer then these hard coded values. Allowing them to be configured means large complex clusters can have longer timeouts if wanted.

How was this change tested?
I added this version as the Karpenter version for the aws-provider repo, built and image and deployed it on a test cluster. Working with the value both set and not set (defaulting to existing values) things were fine.
ie.

replace sigs.k8s.io/karpenter => github.com/domgoodwin/karpenter v0.0.0-20240220105243-cf5336018872

The unit test changes also cover setting the value to a non-default and then timing out after that time.

k8s-ci-robot · 2024-02-20T11:14:39Z

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by: domgoodwin
Once this PR has been reviewed and has the lgtm label, please assign mwielgus for approval. For more information see the Kubernetes Code Review Process.

The full list of commands accepted by this bot can be found here.

Needs approval from an approver in each of these files:

OWNERS

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

k8s-ci-robot · 2024-02-20T11:14:43Z

Welcome @domgoodwin!

It looks like this is your first PR to kubernetes-sigs/karpenter 🎉. Please refer to our pull request process documentation to help your PR have a smooth ride to approval.

You will be prompted by a bot to use commands during the review process. Do not be afraid to follow the prompts! It is okay to experiment. Here is the bot commands documentation.

You can also check if kubernetes-sigs/karpenter has its own contribution guidelines.

You may want to refer to our testing guide if you run into trouble with your tests not passing.

If you are having difficulty getting your pull request seen, please follow the recommended escalation practices. Also, for tips and tricks in the contribution process you may want to read the Kubernetes contributor cheat sheet. We want to make sure your contribution gets all the attention it needs!

Thank you, and welcome to Kubernetes. 😃

k8s-ci-robot · 2024-02-20T11:14:43Z

Hi @domgoodwin. Thanks for your PR.

I'm waiting for a kubernetes-sigs member to verify that this patch is reasonable to test. If it is, they should reply with /ok-to-test on its own line. Until that is done, I will not automatically test new commits in this PR, but the usual testing commands by org members will still work. Regular contributors should join the org to skip this step.

Once the patch is verified, the new status will be reflected by the ok-to-test label.

I understand the commands that are listed here.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

jonathan-innis · 2024-03-02T23:36:21Z

Depending on the size of a cluster and your various affinities/topology spreads this can take longer then these hard coded values

This features starts to look similar to our batch duration parameters, which we were a little iffy on whether or not we should have surfaced in the first-place. Do you also set the batch duration parameters to some custom values overriding the defaults?

The other question that I have: What would you set the values to if you could override them as is proposed here? And what's the size of your cluster that doesn't work well with the current timeout values?

github-actions · 2024-03-17T12:02:13Z

This PR has been inactive for 14 days. StaleBot will close this stale PR after 14 more days of inactivity.

domgoodwin · 2024-04-01T10:54:15Z

Depending on the size of a cluster and your various affinities/topology spreads this can take longer then these hard coded values

This features starts to look similar to our batch duration parameters, which we were a little iffy on whether or not we should have surfaced in the first-place. Do you also set the batch duration parameters to some custom values overriding the defaults?

The other question that I have: What would you set the values to if you could override them as is proposed here? And what's the size of your cluster that doesn't work well with the current timeout values?

I haven't changed the batch duration parameters currently as I figured this was more around scale out, which seems to work ok. We do get a bit of rubber banding where we add more nodes then we need and consolidate down but honestly the speed trade off by not having to run any headroom pods seems worth it.

We actually are running this code as our cluster just wasn't scaling in at all with these timeouts. We've currently set them to 10m, although based on metrics 5m would be fine.

We initially just threw more CPU at Karpenter hoping it would help but it only ever uses 2 CPU cores seemingly

njtran · 2024-04-08T15:11:20Z

I'm hesitant to make this configurable, as changing these knobs can have unknown impacts on the performance of Consolidation. Can you share what cluster sizes you're running at? Are you using any complex scheduling constraints like anti affinity? If anything, I can see making this scale to the size of the cluster in the long run, but I'd be concerned with giving users free reign over this.

domgoodwin · 2024-04-08T15:21:40Z

The cluster is anywhere between 25k and 15k pods from peak to overnight, 250-500 nodes too.

We used to have anti-affinities but removed them across the board and saw a significant scheduling timing performance increase.
We do have podTopologySpread's setup basically on every pod to spread across AZs and hosts (we don't want 15 of a the same deployment on a node) but both of these are mostly "ScheduledAnyway" so soft constraints.

njtran · 2024-04-29T21:17:50Z

How have you come to these numbers? Did you test them out yourself? I can understand that these timeouts shouldn't be one size fits all, and it sounds like the numbers you're proposing would work for your situation better. Especially when adding an API surface (even though it's a feature gate), this is a type of feature that would still require an RFC. Do you mind writing one up? Feel free to reach out to the maintainers on the kubernetes slack to figure out how to write one, or check out the existing RFCs in the repo.

github-actions · 2024-05-14T12:01:57Z

This PR has been inactive for 14 days. StaleBot will close this stale PR after 14 more days of inactivity.

Add options to configure consolidation timeouts

cf53360

k8s-ci-robot added the cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. label Feb 20, 2024

k8s-ci-robot requested review from engedaam and jackfrancis February 20, 2024 11:14

k8s-ci-robot added needs-ok-to-test Indicates a PR that requires an org member to verify it is safe to test. size/M Denotes a PR that changes 30-99 lines, ignoring generated files. labels Feb 20, 2024

domgoodwin mentioned this pull request Feb 21, 2024

Karpenter is slow to deprovision workers after a significant scale-in #903

Open

github-actions bot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Mar 17, 2024

github-actions bot removed the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Apr 1, 2024

jonathan-innis assigned njtran Apr 15, 2024

github-actions bot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label May 14, 2024

github-actions bot added the lifecycle/closed label May 29, 2024

github-actions bot closed this May 29, 2024

Pokom mentioned this pull request Oct 2, 2024

Parameterize Multinode + Single node consolidation timeout #1733

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Add options to configure consolidation timeouts #1031

feat: Add options to configure consolidation timeouts #1031

domgoodwin commented Feb 20, 2024 •

edited

Loading

k8s-ci-robot commented Feb 20, 2024

k8s-ci-robot commented Feb 20, 2024

k8s-ci-robot commented Feb 20, 2024

jonathan-innis commented Mar 2, 2024

github-actions bot commented Mar 17, 2024

domgoodwin commented Apr 1, 2024

njtran commented Apr 8, 2024

domgoodwin commented Apr 8, 2024

njtran commented Apr 29, 2024

github-actions bot commented May 14, 2024

feat: Add options to configure consolidation timeouts #1031

feat: Add options to configure consolidation timeouts #1031

Conversation

domgoodwin commented Feb 20, 2024 • edited Loading

k8s-ci-robot commented Feb 20, 2024

k8s-ci-robot commented Feb 20, 2024

k8s-ci-robot commented Feb 20, 2024

jonathan-innis commented Mar 2, 2024

github-actions bot commented Mar 17, 2024

domgoodwin commented Apr 1, 2024

njtran commented Apr 8, 2024

domgoodwin commented Apr 8, 2024

njtran commented Apr 29, 2024

github-actions bot commented May 14, 2024

domgoodwin commented Feb 20, 2024 •

edited

Loading