Track preserved failed machines in MCS and MCD status#1092
Merged
gardener-prow[bot] merged 12 commits intogardener:masterfrom Apr 27, 2026
Merged
Track preserved failed machines in MCS and MCD status#1092gardener-prow[bot] merged 12 commits intogardener:masterfrom
gardener-prow[bot] merged 12 commits intogardener:masterfrom
Conversation
takoverflow
reviewed
Apr 6, 2026
aaronfern
reviewed
Apr 7, 2026
| return totalAvailableReplicas | ||
| } | ||
|
|
||
| // GetPreservedFailedReplicaCountForMachineSets returns the number of available machines corresponding to the given machine sets. |
Member
There was a problem hiding this comment.
Docstring needs to be corrected
gagan16k
reviewed
Apr 10, 2026
3ef4b96 to
a47f9a5
Compare
r4mek
reviewed
Apr 20, 2026
Contributor
|
/lgtm |
|
LGTM label has been added. DetailsGit tree hash: b6085293a87bf4863ad9b9ac74d7330449a41f91 |
… until node joins cluster
* remove dwd warning * add upper limit to AutoPreserveFailedMachineMax * update de-preservation semantics followed - Update safety options default value for machinePreserveTimeout
2b3c146 to
e10292a
Compare
takoverflow
approved these changes
Apr 27, 2026
|
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: takoverflow The full list of commands accepted by this bot can be found here. The pull request process is described here DetailsNeeds approval from an approver in each of these files:
Approvers can indicate their approval by writing |
|
LGTM label has been added. DetailsGit tree hash: a15d532b63e1e166b69f04c18388a3251ced2c0d |
This was referenced Apr 28, 2026
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
What this PR does / why we need it:
This PR introduces 2 changes:
preservedFailedReplicasadded to the MCS and MCD Status.failedMachinein the MCS and MCD Status. This will also prevent shoot reconciliation from getting stuck.Additionally, the PR modifies the sorting logic for
ActiveMachines-- if two machines are preserved, one auto-preserved and one explicitly preserved through annotation by user/operator, the sorting logic de-prioritizes explicitly preserved machines for deletion.The usage doc for preservation has been updated with a warning regarding the behaviour of DWD when number of preserved failed machines exceeds the threshold set for DWD.
Which issue(s) this PR fixes:
Extends #1008
Special notes for your reviewer:
MCM unit and integration tests passed with changes.
The changes were manually tested out using virtual provider. Below log shows MCD status when it has 1 preserved failed replica.
Release note: