feat: Initial support for cooperative-sticky rebalancing #407

untitaker · 2024-12-13T01:06:44Z

Fix one bug in StreamProcessor where it assumed the passed assignments
are replacing the old ones.

Our consumer backends mostly work as-is, and are already passing the
right values in callbacks.

Fix one bug in StreamProcessor where it assumed the passed assignments are replacing the old ones. Our consumer backends mostly work as-is, and are already passing the right values in callbacks.

lynnagara

Looks good!

Were you wanting to merge + publish as-is or do more testing against this branch?

Is it worth including a test that fails against the current cluster but works with newer versions? Attempting to commit on an existing partition during a rebalance might do it.

arroyo/processing/processor.py

lynnagara · 2024-12-13T05:26:49Z

tests/backends/test_kafka.py

@@ -245,6 +256,10 @@ def test_consumer_polls_when_paused(self) -> None:
                assert consumer.paused() == []


+class TestKafkaStreamsIncrementalRebalancing(TestKafkaStreams):


no, this actually re-declares all the tests in TestKafkaStreams, just re-running them with cooperative-sticky rebalancing

lynnagara · 2024-12-13T05:29:14Z

tests/processing/test_processor.py

    # Second partition assigned
    offsets_p1 = {Partition(topic, 1): 0}
    assignment_callback(offsets_p1)

    create_args, _ = factory.create_with_partitions.call_args
    assert factory.create_with_partitions.call_count == 2
-    assert create_args[1] == offsets_p1
+    assert create_args[1] == {**offsets_p1, **offsets_p0}


was this test change related to your other changes? since there's no cooperative rebalancing here, seems like the assertions should stay the same?

the mocked return value for consumer.tell was wrong, so this had the wrong value. the assignments in this test are actually incremental: first p1 is assigned, then p0, and there's no revocation.

lynnagara · 2024-12-13T06:40:27Z

arroyo/backends/kafka/consumer.py

@@ -161,6 +161,7 @@ def __init__(
        )

        configuration = dict(configuration)
+        self.__assignment_strategy = configuration.get("partition.assignment.strategy")


sorry i said the wrong thing earlier, this should be group.protocol

after discussing offline i think we can support KIP-848 (group.protocol) as well as cooperative-sticky rebalancing. they're the same as far as rdkafka API is concerned. i just can't get it to work right now and might scope it out of this PR if it takes too much time.

untitaker · 2024-12-14T01:01:49Z

Is it worth including a test that fails against the current cluster but works with newer versions? Attempting to commit on an existing partition during a rebalance might do it.

can you elaborate on this?

fpacifici · 2024-12-16T12:49:03Z

arroyo/backends/kafka/consumer.py

+                logger.info("skipping empty assignment")
+                return


Why do you need a different logic for partition assignment between cooperative and standard in case of empty assignment?
I assume you can get an empty assignment in the cooperative rebalancing when, after a rebalancing, your assignments do not change. Is that the scenario where you do not want to touch the existing assignments ?

after sleeping on it i agree. i only added this because it made cooperative rebalancing more comprehensible, and wasn't sure of the implications on regular rebalancing. i think we can skip empty assignments regardless of the assignment strategy.

untitaker · 2024-12-16T15:55:32Z

tests/test_dlq.py

@@ -107,7 +107,7 @@ def test_dlq_policy_wrapper() -> None:
    )
    partition = Partition(topic, 0)
    wrapper = DlqPolicyWrapper(dlq_policy)
-    wrapper.reset_offsets({partition: 0})
+    wrapper.reset_dlq_limits({partition: 0})


this rename is just to align with rust btw

feat: Initial support for cooperative-sticky rebalancing

c68f56f

Fix one bug in StreamProcessor where it assumed the passed assignments are replacing the old ones. Our consumer backends mostly work as-is, and are already passing the right values in callbacks.

untitaker requested review from a team as code owners December 13, 2024 01:06

revert broken changes to pytest.ini

79a95f9

lynnagara reviewed Dec 13, 2024

View reviewed changes

untitaker added 2 commits December 14, 2024 01:59

apply review feedback

c5d7b52

Merge remote-tracking branch 'origin/main' into incremental-rebalancing

cd0cc07

fpacifici reviewed Dec 16, 2024

View reviewed changes

untitaker commented Dec 16, 2024

View reviewed changes

untitaker added 2 commits December 16, 2024 16:58

wip on supporting kip-848

6d43ebb

skip empty assignments all the time

c76ceae

lynnagara approved these changes Dec 17, 2024

View reviewed changes

untitaker merged commit 8ba2e54 into main Dec 17, 2024
14 checks passed

untitaker deleted the incremental-rebalancing branch December 17, 2024 00:27

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Initial support for cooperative-sticky rebalancing #407

feat: Initial support for cooperative-sticky rebalancing #407

untitaker commented Dec 13, 2024

lynnagara left a comment •

edited

Loading

lynnagara Dec 13, 2024

untitaker Dec 14, 2024

lynnagara Dec 13, 2024

untitaker Dec 14, 2024

lynnagara Dec 13, 2024

untitaker Dec 14, 2024

untitaker commented Dec 14, 2024

fpacifici Dec 16, 2024

untitaker Dec 16, 2024

untitaker Dec 16, 2024

		@@ -245,6 +256,10 @@ def test_consumer_polls_when_paused(self) -> None:
		assert consumer.paused() == []


		class TestKafkaStreamsIncrementalRebalancing(TestKafkaStreams):

feat: Initial support for cooperative-sticky rebalancing #407

feat: Initial support for cooperative-sticky rebalancing #407

Conversation

untitaker commented Dec 13, 2024

lynnagara left a comment • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

untitaker commented Dec 14, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

lynnagara left a comment •

edited

Loading