pageserver: guard against WAL gaps in the interpreted protocol #10858

VladLazar · 2025-02-17T17:18:36Z

Problem

The interpreted SK <-> PS protocol does not guard against gaps (neither does the Vanilla one, but that's beside the point).

Summary of changes

Extend the protocol to include the start LSN of the PG WAL section from which the records were interpreted.
Validation is enabled via a config flag on the pageserver and works as follows:

Case 1: raw_wal_start_lsn is smaller than the requested LSN
There can't be gaps here, but we check that the shard received records which it hasn't seen before.

Case 2: raw_wal_start_lsn is equal to the requested LSN
This is the happy case. No gap and nothing to check

Case 3: raw_wal_start_lsn is greater than the requested LSN
This is a gap.

To make Case 3 work I had to bend the protocol a bit.
We read record chunks of WAL which aren't record aligned and feed them to the decoder.
The picture below shows a shard which subscribes at a position somewhere within Record 2.
We already have a wal reader which is below that position so we wait to catch up.
We read some wal in Read 1 (all of Record 1 and some of Record 2). The new shard doesn't
need Record 1 (it has already processed it according to the starting position), but we read
past it's starting position. When we do Read 2, we decode Record 2 and ship it off to the shard,
but the starting position of Read 2 is greater than the starting position the shard requested.
This looks like a gap.

To make it work, we extend the protocol to send an empty InterpretedWalRecords to shards
if the WAL the records originated from ends the requested start position. On the pageserver,
that just updates the tracking LSNs in memory (no-op really). This gives us a workaround for
the fake gap.

As a drive by, make InterpretedWalRecords::next_record_lsn mandatory in the application level definition.
It's always included.

Related: https://github.com/neondatabase/cloud/issues/23935

github-actions · 2025-02-17T18:48:12Z

7557 tests run: 7184 passed, 0 failed, 373 skipped (full report)

Flaky tests (1)

Postgres 17

test_migration_to_cold_secondary: release-arm64-without-lfc

Code coverage* (full report)

functions: 32.9% (8625 of 26199 functions)
lines: 48.9% (72771 of 148957 lines)

* collected from Rust tests only

_{The comment gets automatically updated with the latest test results
c642939 at 2025-02-20T15:52:13.121Z :recycle:}

Add the start LSN of the PG WAL from which the batch of records originated. New pageservers can read messages from old safekeepers, the field will be None. Old pageserver can read messages from new safekeepers. The field is ignored.

This field is always included with every batch of interpreted records. It's an optional field in the proto definition, but it doesn't have to be optional in the application level definition.

libs/wal_decoder/src/models.rs

libs/wal_decoder/proto/interpreted_wal.proto

pageserver/src/bin/pageserver.rs

pageserver/src/tenant/timeline/walreceiver/walreceiver_connection.rs

safekeeper/src/send_interpreted_wal.rs

VladLazar requested a review from a team as a code owner February 17, 2025 17:18

VladLazar requested a review from arssher February 17, 2025 17:18

VladLazar marked this pull request as draft February 17, 2025 17:19

VladLazar removed the request for review from arssher February 17, 2025 17:22

VladLazar changed the title ~~Vlad/ps check wal contiguity~~ pageserver: guard against WAL gaps in the interpreted protocol Feb 17, 2025

VladLazar force-pushed the vlad/ps-check-wal-contiguity branch from d2c1547 to 8497de9 Compare February 17, 2025 17:27

VladLazar requested review from erikgrinaker and removed request for a team February 20, 2025 10:29

VladLazar marked this pull request as ready for review February 20, 2025 10:29

VladLazar added 8 commits February 20, 2025 11:34

safekeeper: make next_record_lsn mandatory

3b525f1

This field is always included with every batch of interpreted records. It's an optional field in the proto definition, but it doesn't have to be optional in the application level definition.

pageserver: guard against WAL gaps

8624424

fixup: handle no-op WAL chunks

7bf6bdd

fixup: relax the check a bit

4a6678d

fixup: check against the last last record LSN

90917ce

fixup: flip condition

217db91

fixup: treat fake gaps of WAL which doesn't decode yet

138a21b

VladLazar force-pushed the vlad/ps-check-wal-contiguity branch from c612bd3 to 123ccac Compare February 20, 2025 10:34

pageserver: hide validation behind a flag

66e84d0

VladLazar force-pushed the vlad/ps-check-wal-contiguity branch from 123ccac to 66e84d0 Compare February 20, 2025 11:43

erikgrinaker approved these changes Feb 20, 2025

View reviewed changes

VladLazar added 3 commits February 20, 2025 15:55

tests: temporarily skip contiguity checking in compat tests

51d3505

tests: remove concurrent IO skip for compat

7a366ae

fixup: grammar

c642939

VladLazar added this pull request to the merge queue Feb 20, 2025

Merged via the queue into main with commit 3499641 Feb 20, 2025
90 checks passed

VladLazar deleted the vlad/ps-check-wal-contiguity branch February 20, 2025 17:58

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

pageserver: guard against WAL gaps in the interpreted protocol #10858

pageserver: guard against WAL gaps in the interpreted protocol #10858

VladLazar commented Feb 17, 2025 •

edited

Loading

github-actions bot commented Feb 17, 2025 •

edited

Loading

Postgres 17

pageserver: guard against WAL gaps in the interpreted protocol #10858

pageserver: guard against WAL gaps in the interpreted protocol #10858

Conversation

VladLazar commented Feb 17, 2025 • edited Loading

Problem

Summary of changes

github-actions bot commented Feb 17, 2025 • edited Loading

7557 tests run: 7184 passed, 0 failed, 373 skipped (full report)

Postgres 17

Code coverage* (full report)

VladLazar commented Feb 17, 2025 •

edited

Loading

github-actions bot commented Feb 17, 2025 •

edited

Loading