-
Notifications
You must be signed in to change notification settings - Fork 470
Add docs for leader-leaseholder splits #19755
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Conversation
✅ Deploy Preview for cockroachdb-interactivetutorials-docs canceled.
|
Files changed:
|
✅ Deploy Preview for cockroachdb-api-docs canceled.
|
hi @miraradeva i hope PR this captures the gist of our convo (which i copied to the linked docs ticket) but please let me know if i'm missing some important information, or if anything is incorrect |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Look great, thanks!
✅ Netlify Preview
To edit notification comments on pull requests, go to your Netlify project configuration. |
thanks @miraradeva ! I have made updates based on your feedback in 7c1d5cf, thank you for the review! i will move this along for docs team review to get it published ASAP |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM, couple suggestions
@@ -160,6 +160,15 @@ Unlike table data, system ranges use expiration-based leases; expiration-based l | |||
|
|||
Expiration-based leases are also used temporarily during operations like lease transfers, until the new Raft leader can be fortified based on store liveness, as described in [Leader leases](#leader-leases). | |||
|
|||
#### Leader‑leaseholder splits | |||
|
|||
[Epoch-based leases](#epoch-based-leases) are vulnerable to _leader-leaseholder splits_. These can occur when a leaseholder’s Raft log has fallen behind and cannot acquire Raft leadership. Coupled with a [network partition]({% link {{ page.version.version }}/cluster-setup-troubleshooting.md %}#network-partition), this split can cause permanent unavailability of the range if (1) the stale leaseholder continues heartbeating the [liveness range](#liveness-range) to hold its lease but (2) cannot reach the leader to propose writes. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Replace all smart quotes
[Epoch-based leases](#epoch-based-leases) are vulnerable to _leader-leaseholder splits_. These can occur when a leaseholder’s Raft log has fallen behind and cannot acquire Raft leadership. Coupled with a [network partition]({% link {{ page.version.version }}/cluster-setup-troubleshooting.md %}#network-partition), this split can cause permanent unavailability of the range if (1) the stale leaseholder continues heartbeating the [liveness range](#liveness-range) to hold its lease but (2) cannot reach the leader to propose writes. | |
[Epoch-based leases](#epoch-based-leases) are vulnerable to _leader-leaseholder splits_. These can occur when a leaseholder's Raft log has fallen behind and cannot acquire Raft leadership. Coupled with a [network partition]({% link {{ page.version.version }}/cluster-setup-troubleshooting.md %}#network-partition), this split can cause permanent unavailability of the range if (1) the stale leaseholder continues heartbeating the [liveness range](#liveness-range) to hold its lease but (2) cannot reach the leader to propose writes. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I might just be tired at the moment, but "Raft log has fallen behind" might be unclear as to "what" the Raft log has fallen behind.
@@ -320,7 +320,7 @@ Send an alert when the number of ranges with replication below the replication f | |||
|
|||
### Requests stuck in raft |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
### Requests stuck in raft | |
### Requests stuck in Raft |
@@ -160,6 +160,15 @@ Unlike table data, system ranges use expiration-based leases; expiration-based l | |||
|
|||
Expiration-based leases are also used temporarily during operations like lease transfers, until the new Raft leader can be fortified based on store liveness, as described in [Leader leases](#leader-leases). | |||
|
|||
#### Leader‑leaseholder splits | |||
|
|||
[Epoch-based leases](#epoch-based-leases) are vulnerable to _leader-leaseholder splits_. These can occur when a leaseholder’s Raft log has fallen behind and cannot acquire Raft leadership. Coupled with a [network partition]({% link {{ page.version.version }}/cluster-setup-troubleshooting.md %}#network-partition), this split can cause permanent unavailability of the range if (1) the stale leaseholder continues heartbeating the [liveness range](#liveness-range) to hold its lease but (2) cannot reach the leader to propose writes. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Is this correct / worth adding?
[Epoch-based leases](#epoch-based-leases) are vulnerable to _leader-leaseholder splits_. These can occur when a leaseholder’s Raft log has fallen behind and cannot acquire Raft leadership. Coupled with a [network partition]({% link {{ page.version.version }}/cluster-setup-troubleshooting.md %}#network-partition), this split can cause permanent unavailability of the range if (1) the stale leaseholder continues heartbeating the [liveness range](#liveness-range) to hold its lease but (2) cannot reach the leader to propose writes. | |
[Epoch-based leases](#epoch-based-leases) (unlike [Leader leases](#leader-leases)) are vulnerable to _leader-leaseholder splits_. These can occur when a leaseholder’s Raft log has fallen behind and cannot acquire Raft leadership. Coupled with a [network partition]({% link {{ page.version.version }}/cluster-setup-troubleshooting.md %}#network-partition), this split can cause permanent unavailability of the range if (1) the stale leaseholder continues heartbeating the [liveness range](#liveness-range) to hold its lease but (2) cannot reach the leader to propose writes. |
Fixes DOC-13920