Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat(storcon): chaos injection of force exit #10934

Merged
merged 3 commits into from
Feb 24, 2025
Merged

feat(storcon): chaos injection of force exit #10934

merged 3 commits into from
Feb 24, 2025

Conversation

skyzh
Copy link
Member

@skyzh skyzh commented Feb 21, 2025

Problem

close /~https://github.com/neondatabase/cloud/issues/24485

Summary of changes

This patch adds a new chaos injection mode for the storcon. The chaos injector reads the crontab and exits immediately at the configured time.

Signed-off-by: Alex Chi Z <chi@neon.tech>
@skyzh skyzh requested a review from jcsp February 21, 2025 17:08
@skyzh skyzh requested a review from a team as a code owner February 21, 2025 17:08
@jcsp
Copy link
Contributor

jcsp commented Feb 21, 2025

Seems reasonable (I guess this was easier than maintaining external cronjob?) -- let's make sure devprod are aligned on the time of day to do this + make it CLI-configurable.

@skyzh
Copy link
Member Author

skyzh commented Feb 21, 2025

Yes, cronjob needs some complex permission setup in kubernetes to allow a job to manipulate the cluster and I think it will take longer than this approach :(

Copy link

github-actions bot commented Feb 21, 2025

7601 tests run: 7225 passed, 0 failed, 376 skipped (full report)


Flaky tests (3)

Postgres 17

Code coverage* (full report)

  • functions: 32.8% (8624 of 26278 functions)
  • lines: 48.7% (72609 of 149192 lines)

* collected from Rust tests only


The comment gets automatically updated with the latest test results
384493f at 2025-02-24T05:44:54.692Z :recycle:

@skyzh skyzh changed the title [DNM] feat(storcon): chaos injection of force exit feat(storcon): chaos injection of force exit Feb 24, 2025
Signed-off-by: Alex Chi Z <chi@neon.tech>
@skyzh skyzh force-pushed the skyzh/storcon-chaos branch from d9d1b6f to 2608bc2 Compare February 24, 2025 03:31
Signed-off-by: Alex Chi Z <chi@neon.tech>
@skyzh
Copy link
Member Author

skyzh commented Feb 24, 2025

ready for review :) The next step: discuss the injection time, and add support to helm-charts + PR to the infra repo

@skyzh skyzh added this pull request to the merge queue Feb 24, 2025
Merged via the queue into main with commit 5fad4a4 Feb 24, 2025
90 checks passed
@skyzh skyzh deleted the skyzh/storcon-chaos branch February 24, 2025 15:43
github-merge-queue bot pushed a commit that referenced this pull request Feb 28, 2025
## Problem

Somehow the previous patch loses the loop in the chaos injector function
so everything will only run once.
#10934

## Summary of changes

Add back the loop.

Signed-off-by: Alex Chi Z <chi@neon.tech>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants