PS/Prefetch: Use a timeout for reading data from TCP #10834

MMeent · 2025-02-14T22:25:01Z

This reduces pressure on OS TCP buffers, reducing flush times in other systems like PageServer.

Problem

Summary of changes

github-actions · 2025-02-14T22:37:59Z

7744 tests run: 7366 passed, 0 failed, 378 skipped (full report)

Code coverage* (full report)

functions: 32.8% (8642 of 26359 functions)
lines: 48.7% (73209 of 150476 lines)

* collected from Rust tests only

_{The comment gets automatically updated with the latest test results
e456cc9 at 2025-02-27T13:31:47.712Z :recycle:}

github-actions · 2025-02-20T11:04:43Z

If this PR added a GUC in the Postgres fork or neon extension,
please regenerate the Postgres settings in the cloud repo:

make NEON_WORKDIR=path/to/neon/checkout \
  -C goapp/internal/shareddomain/postgres generate

If you're an external contributor, a Neon employee will assist in
making sure this step is done.

pgxn/neon/pagestore_smgr.c

skyzh

LGTM from the storage side

pgxn/neon/libpagestore.c

hlinnaka · 2025-02-26T10:33:23Z

This still makes me pretty squirmish. Are you sure you might not "re-enter" some parts of the prefetching code? The readpage_reentrant_guard is supposed to prevent that, but I can't convince myself that it's enough and that it's used in the right places. I don't see any direct bug or issue either though.

For example, can an interrupt happen while we're in a prefetch_register_bufferv() call? Is it OK? I don't see any CHECK_FOR_INTERRUPTS() calls there, but it's a long function, it's hard to tell at a quick glance.

Does the guard protect some specific fields of MyPState?

hlinnaka · 2025-02-26T10:42:16Z

prefetch_pump_state has this comment:

 * Note that this works because we don't pipeline non-getPage requests.

Took me a while to understand, but I got it now:

smgrnblocks()
 -> neon_nblocks()
   -> page_server_request()
     -> page_server->receive()
       -> pageserver_receive()
         -> call_PQgetCopyData()
           -> CHECK_FOR_INTERRUPTS()
             -> pagestore_smgr_processinterrupts()
               -> prefetch_pump_state()
                 -> page_server->try_receive()
                   -> PQgetCopyData()

The above cannot happen, because prefetch_pump_state() would find that there are no in-flight requests in the ring.

So that's OK, although feels a bit fragile.

hlinnaka

I don't see a concrete problem with this even though I'm a bit squirmish.

Let's add a GUC for this though. I'd love to have this in staging and pre-prod for a while, and perhaps do a slow roll out to production too, starting with the endpoints that have experienced problems with the socket buffers filling up.

I think a simple check in reconfigure_timeout_if_needed to do nothing if the GUC is not set would do the trick. Or since we're adding a GUC, perhaps make it an integer GUC to replace the PS_BACKGROUND_DELAY constant. Then we can also experiment making it more or less aggressive.

This reduces pressure on OS TCP buffers, reducing TCP backpressure and therefore flush times in upstream systems like PageServer. The timeout triggers every 100 ms (compile-time constant) if there are any outstanding getpage requests.

That's better than a builtin constant, as it allows us to better control the behaviour of this feature.

MMeent · 2025-02-26T19:30:58Z

@hlinnaka Did you mean something like this latest? (i.e. commit 2 of 3)

pgxn/neon/libpagestore.c

- Adjust comment on readahead_getpage_pull_timeout_ms - Use 0 for disabling the timeout, not both 0 and -1 - Limit the timeout to 5 minutes

MMeent force-pushed the MMeent/timeout-based-prefetch-queue-draining branch from 65b6124 to cce5d5d Compare February 17, 2025 16:31

MMeent marked this pull request as ready for review February 20, 2025 15:22

MMeent requested review from a team as code owners February 20, 2025 15:22

MMeent requested review from knizhnik, tristan957 and skyzh February 20, 2025 15:22

MMeent force-pushed the MMeent/timeout-based-prefetch-queue-draining branch from f94533c to f24e99e Compare February 20, 2025 15:26

MMeent enabled auto-merge February 20, 2025 23:19

hlinnaka reviewed Feb 21, 2025

View reviewed changes

MMeent force-pushed the MMeent/timeout-based-prefetch-queue-draining branch 2 times, most recently from 6db9ede to 93ab153 Compare February 21, 2025 16:38

skyzh approved these changes Feb 21, 2025

View reviewed changes

MMeent force-pushed the MMeent/timeout-based-prefetch-queue-draining branch 2 times, most recently from 0caec61 to b86b759 Compare February 21, 2025 17:07

MMeent requested a review from hlinnaka February 21, 2025 17:09

MMeent force-pushed the MMeent/timeout-based-prefetch-queue-draining branch from b86b759 to 7214004 Compare February 25, 2025 15:30

hlinnaka reviewed Feb 25, 2025

View reviewed changes

pgxn/neon/libpagestore.c Show resolved Hide resolved

hlinnaka approved these changes Feb 26, 2025

View reviewed changes

MMeent added 2 commits February 26, 2025 12:10

Use a GUC to determine the prefetch timeout

db031d7

That's better than a builtin constant, as it allows us to better control the behaviour of this feature.

MMeent force-pushed the MMeent/timeout-based-prefetch-queue-draining branch from 7214004 to db031d7 Compare February 26, 2025 14:55

Reset timeout signal after handling the signal.

99918d2

hlinnaka approved these changes Feb 26, 2025

View reviewed changes

pgxn/neon/libpagestore.c Outdated Show resolved Hide resolved

pgxn/neon/libpagestore.c Outdated Show resolved Hide resolved

Apply PR comments

e456cc9

- Adjust comment on readahead_getpage_pull_timeout_ms - Use 0 for disabling the timeout, not both 0 and -1 - Limit the timeout to 5 minutes

MMeent added this pull request to the merge queue Feb 27, 2025

Merged via the queue into main with commit a283eda Feb 27, 2025
91 checks passed

MMeent deleted the MMeent/timeout-based-prefetch-queue-draining branch February 27, 2025 14:01

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

PS/Prefetch: Use a timeout for reading data from TCP #10834

PS/Prefetch: Use a timeout for reading data from TCP #10834

MMeent commented Feb 14, 2025

github-actions bot commented Feb 14, 2025 •

edited

Loading

github-actions bot commented Feb 20, 2025

skyzh left a comment

hlinnaka commented Feb 26, 2025

hlinnaka commented Feb 26, 2025

hlinnaka left a comment

MMeent commented Feb 26, 2025

PS/Prefetch: Use a timeout for reading data from TCP #10834

PS/Prefetch: Use a timeout for reading data from TCP #10834

Conversation

MMeent commented Feb 14, 2025

Problem

Summary of changes

github-actions bot commented Feb 14, 2025 • edited Loading

7744 tests run: 7366 passed, 0 failed, 378 skipped (full report)

Code coverage* (full report)

github-actions bot commented Feb 20, 2025

skyzh left a comment

Choose a reason for hiding this comment

hlinnaka commented Feb 26, 2025

hlinnaka commented Feb 26, 2025

hlinnaka left a comment

Choose a reason for hiding this comment

MMeent commented Feb 26, 2025

github-actions bot commented Feb 14, 2025 •

edited

Loading