Skip to content

Tune ardac1prd Karpenter disruption settings#63

Merged
esentuna merged 1 commit into
mainfrom
codex/karpenter-disruption-tuning
Jun 4, 2026
Merged

Tune ardac1prd Karpenter disruption settings#63
esentuna merged 1 commit into
mainfrom
codex/karpenter-disruption-tuning

Conversation

@alan-walsh

Copy link
Copy Markdown
Contributor

Summary

  • reduce default Karpenter voluntary disruption by changing ardac1prd default consolidation to WhenEmpty
  • exclude nano, micro, and small instance sizes from the default Karpenter NodePool
  • enable Gen3 PDBs and hard per-node topology spread for supported multi-replica services

Validation

  • helm lint passed for Gen3 chart with ardac1prd portal values
  • helm lint passed for cluster-level-resources chart with ardac1prd cluster values
  • helm template confirmed PDB minAvailable 1 and DoNotSchedule hostname topology spread
  • helm template confirmed default NodePool renders instance-size NotIn nano micro small and WhenEmpty consolidation
  • git diff --check passed

Notes

  • yamllint still reports pre-existing CRLF line endings and comment-spacing warnings in the edited YAML files; this PR avoids broad formatting churn

Copilot AI left a comment

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

This PR adjusts production (ardac1prd) configuration to reduce Karpenter voluntary disruption while improving workload resilience during node maintenance/conpression by enabling PodDisruptionBudgets (PDBs) and enforcing per-node topology spread where supported by the Gen3 chart.

Changes:

  • Enable Gen3 PDBs in ardac1prd portal values with minAvailable: 1, and configure hostname-based topology spread with DoNotSchedule.
  • Reduce default Karpenter consolidation aggressiveness by switching consolidation policy to WhenEmpty.
  • Exclude nano, micro, and small EC2 instance sizes from the default Karpenter NodePool requirements.

Reviewed changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated no comments.

File Description
ardac1prd/portal.ardac.org/values.yaml Enables PDBs and configures hostname topology spread defaults for supported services.
ardac1prd/cluster-level-resources/cluster-values.yaml Tunes default Karpenter disruption/consolidation and filters out small instance sizes.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

@esentuna esentuna merged commit 8d500c9 into main Jun 4, 2026
2 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants