koord-manager: enhance batch resource configuration and updating #1703

saintube · 2023-10-09T13:35:11Z

Ⅰ. Describe what this PR does

koord-manager: enhance batch resource configuration and updating:

Refactor the NRT updating in the BatchResource plugin. Add the PreUpdate stage for checking and updating additional objects alongside the framework's Node updating.
Support the cpuCalculatePolicy in the ColocationStrategy. Enable calculating batch-cpu according to high-priority pods' maximal of requests and usages. The default policy is usage, which allows the low-priority pods to reclaim the resources requested but unused by the high-priority pods. The new policy maxUsageAndRequest is helpful when the cpu resources are not strongly expected to be overcommitted between different priority bands, where neither used resources nor requested resources can be allocatable to the low-priority pods.
Support colocation strategy based on the node metadata. Add an annotation protocol for node-level configuration. Add two label protocols for label-preferred use cases.

Ⅱ. Does this pull request fix one issue?

Ⅲ. Describe how to verify it

Ⅳ. Special notes for reviews

V. Checklist

I have written necessary docs and comments
I have added necessary unit tests and integration tests
All checks passed in make test

codecov · 2023-10-09T13:41:46Z

Codecov Report

Attention: 37 lines in your changes are missing coverage. Please review.

Comparison is base (8c19de1) 65.91% compared to head (6e4c7e8) 65.93%.
Report is 4 commits behind head on main.

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #1703      +/-   ##
==========================================
+ Coverage   65.91%   65.93%   +0.01%     
==========================================
  Files         385      385              
  Lines       41639    41766     +127     
==========================================
+ Hits        27447    27539      +92     
- Misses      12155    12186      +31     
- Partials     2037     2041       +4

Flag	Coverage Δ
unittests	`65.93% <81.77%> (+0.01%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

Files	Coverage Δ
...o-controller/config/colocation_cm_event_handler.go	`75.86% <100.00%> (ø)`
...er/noderesource/plugins/cpunormalization/plugin.go	`86.98% <100.00%> (ø)`
...troller/noderesource/plugins/midresource/plugin.go	`81.30% <100.00%> (ø)`
pkg/slo-controller/noderesource/plugins_profile.go	`100.00% <100.00%> (ø)`
...slo-controller/noderesource/resource_calculator.go	`74.83% <100.00%> (+0.51%)`	⬆️
...oller/noderesource/plugins/batchresource/plugin.go	`76.07% <94.44%> (+1.56%)`	⬆️
pkg/util/sloconfig/colocation_config.go	`93.70% <95.08%> (+0.94%)`	⬆️
...troller/noderesource/plugins/batchresource/util.go	`85.30% <81.81%> (-1.58%)`	⬇️
...ntroller/noderesource/framework/extender_plugin.go	`43.88% <40.47%> (-0.19%)`	⬇️

... and 5 files with indirect coverage changes

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

pkg/slo-controller/noderesource/framework/extender_plugin.go

apis/extension/node_colocation.go

hormes · 2023-10-14T02:05:44Z

Enable calculating batch-cpu according to high-priority pods' requests instead of usages.

Explain when you need to use request instead of usage?

apis/extension/node_colocation.go

saintube · 2023-10-16T05:50:19Z

Enable calculating batch-cpu according to high-priority pods' requests instead of usages.

Explain when you need to use request instead of usage?

Code comments are added for different calculating policies.

zwzhang0107 · 2023-10-17T03:32:14Z

/lgtm

zwzhang0107 · 2023-10-17T03:33:17Z

/approve

Signed-off-by: saintube <[email protected]>

hormes · 2023-10-19T03:50:11Z

/lgtm
/approve

koordinator-bot · 2023-10-19T03:50:18Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: hormes, zwzhang0107

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~OWNERS~~ [hormes]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

koordinator-bot bot added the do-not-merge/work-in-progress label Oct 9, 2023

koordinator-bot bot requested review from FillZpp and zwzhang0107 October 9, 2023 13:35

koordinator-bot bot added the size/XXL label Oct 9, 2023

saintube force-pushed the koord-manager-enhance-batchresource-for-node-config branch from df39afe to 8726ac4 Compare October 10, 2023 02:10

saintube changed the title ~~[WIP] koord-manager: enhance batch resource configuration and updating~~ koord-manager: enhance batch resource configuration and updating Oct 10, 2023

koordinator-bot bot removed the do-not-merge/work-in-progress label Oct 10, 2023

saintube requested review from eahydra, hormes and jasonliu747 October 10, 2023 05:15

zwzhang0107 reviewed Oct 11, 2023

View reviewed changes

eahydra reviewed Oct 12, 2023

View reviewed changes

apis/extension/node_colocation.go Outdated Show resolved Hide resolved

saintube force-pushed the koord-manager-enhance-batchresource-for-node-config branch from 8726ac4 to 235bc6b Compare October 12, 2023 12:25

hormes reviewed Oct 14, 2023

View reviewed changes

apis/extension/node_colocation.go Show resolved Hide resolved

saintube force-pushed the koord-manager-enhance-batchresource-for-node-config branch from 235bc6b to 24f36f5 Compare October 16, 2023 05:49

koordinator-bot bot assigned zwzhang0107 Oct 17, 2023

koordinator-bot bot added the lgtm label Oct 17, 2023

koord-manager: enhance batch resource configuration and updating

6e4c7e8

Signed-off-by: saintube <[email protected]>

saintube force-pushed the koord-manager-enhance-batchresource-for-node-config branch from 24f36f5 to 6e4c7e8 Compare October 17, 2023 13:41

koordinator-bot bot removed the lgtm label Oct 17, 2023

koordinator-bot bot assigned hormes Oct 19, 2023

koordinator-bot bot added the lgtm label Oct 19, 2023

koordinator-bot bot added the approved label Oct 19, 2023

koordinator-bot bot merged commit 543358d into koordinator-sh:main Oct 19, 2023
21 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

koord-manager: enhance batch resource configuration and updating #1703

koord-manager: enhance batch resource configuration and updating #1703

saintube commented Oct 9, 2023 •

edited

Loading

codecov bot commented Oct 9, 2023 •

edited

Loading

hormes commented Oct 14, 2023

saintube commented Oct 16, 2023 •

edited

Loading

zwzhang0107 commented Oct 17, 2023

zwzhang0107 commented Oct 17, 2023

hormes commented Oct 19, 2023

koordinator-bot bot commented Oct 19, 2023

koord-manager: enhance batch resource configuration and updating #1703

koord-manager: enhance batch resource configuration and updating #1703

Conversation

saintube commented Oct 9, 2023 • edited Loading

Ⅰ. Describe what this PR does

Ⅱ. Does this pull request fix one issue?

Ⅲ. Describe how to verify it

Ⅳ. Special notes for reviews

V. Checklist

codecov bot commented Oct 9, 2023 • edited Loading

Codecov Report

hormes commented Oct 14, 2023

saintube commented Oct 16, 2023 • edited Loading

zwzhang0107 commented Oct 17, 2023

zwzhang0107 commented Oct 17, 2023

hormes commented Oct 19, 2023

koordinator-bot bot commented Oct 19, 2023

saintube commented Oct 9, 2023 •

edited

Loading

codecov bot commented Oct 9, 2023 •

edited

Loading

saintube commented Oct 16, 2023 •

edited

Loading