Learn/Kubernetes/Autoscaling

Kubernetes

Autoscaling

15 articles from official documentation

Practitioner15 articles

kubernetesautoscalingPractitioner

Mastering Zonal Shift Support for EKS Auto Mode and Karpenter

Zonal Shift is a game-changer for maintaining application availability during zone failures in EKS. This feature automatically manages node provisioning and traffic routing, minimizing downtime. Dive in to understand how it works and what you need to implement it effectively.

→Enable Zonal Shift by setting the --enableZonalShift flag to true on the Karpenter controller.
→Use the command 'aws eks update-cluster-config --name <cluster-name> --region eu-north-1 --zonal-shift-config enabled=true' to activate Zonal Shift.

5 min read·AWS Containers Blog

kubernetesautoscalingPractitioner

Mastering Serverless with the Headlamp Plugin for Knative

Unlock the full potential of your Knative workloads with the Headlamp plugin. This tool allows you to manage KServices, traffic splitting, and autoscaling configurations all in one place. Dive in to see how it enhances your serverless experience.

→Utilize the Headlamp plugin to gain visibility into your Knative workloads.
→Manage KServices effectively with live editing capabilities for traffic splits and autoscaling.

5 min read·Kubernetes Blog

kubernetesautoscalingPractitioner

Unlocking EKS Auto Mode: Speed and Efficiency for Kubernetes Nodes

EKS Auto Mode is a game changer for Kubernetes scaling, delivering faster node management and smarter resource allocation. With Karpenter's 43% faster scale-out, you can optimize your cluster like never before.

→Leverage Karpenter for 43% faster scale-out in EKS Auto Mode.
→Increase `registryPullQPS` and `registryBurst` to 25 and 50 for optimal image pulling.

5 min read·AWS Containers Blog

kubernetesautoscalingPractitioner

Unlocking Efficiency: Amazon EKS Auto Mode Meets Istio Ambient Mesh

Streamline your Kubernetes workloads with the powerful combination of Amazon EKS Auto Mode and Istio Ambient Mesh. This integration automates node management while providing seamless mutual TLS encryption across your services. Discover how to leverage these technologies for enhanced security and performance.

→Automate node provisioning and scaling with Amazon EKS Auto Mode.
→Implement mutual TLS encryption seamlessly using Istio Ambient Mesh.

5 min read·AWS Containers Blog

kubernetesautoscalingPractitioner

Scaling StarRocks on EKS: Harnessing KEDA and Karpenter for OLAP Power

Unlock the full potential of your OLAP workloads with StarRocks on Amazon EKS. Learn how KEDA and Karpenter enable near-instant scaling of compute resources while maintaining a cost-effective shared-data architecture.

→Utilize KEDA for autoscaling BE and CN nodes based on Prometheus metrics.
→Deploy BE nodes as StatefulSets for durable storage using Amazon EBS volumes.

5 min read·AWS Containers Blog

kubernetesautoscalingPractitioner

Scaling StarRocks on EKS: Harnessing KEDA and Karpenter for OLAP Efficiency

In the world of enterprise OLAP workloads, scaling efficiently is crucial. By leveraging KEDA for autoscaling and Karpenter for node provisioning on Amazon EKS, you can dynamically adjust your StarRocks cluster to meet fluctuating query demands without data movement.

→Leverage KEDA for near-instant scaling of compute resources based on Prometheus metrics.
→Utilize the StarRocks Kubernetes Operator for automated cluster lifecycle management.

5 min read·AWS Containers Blog

kubernetesautoscalingPractitioner

GPU Autoscaling in Kubernetes: Mastering KEDA with External Scalers

Unlock the power of GPU autoscaling in Kubernetes with KEDA. Learn how to build a custom external scaler that reads GPU metrics via NVML and drives Horizontal Pod Autoscaler (HPA) decisions. This is essential for optimizing resource usage in GPU-heavy workloads.

→Build a custom DaemonSet to read GPU metrics using NVML.
→Serve GPU metrics over gRPC with KEDA's ExternalScaler interface.

5 min read·CNCF Blog

kubernetesautoscalingPractitioner

Unlocking Efficiency with Amazon EKS Auto Mode: Strategies for Control and Optimization

Amazon EKS Auto Mode is a game changer for Kubernetes management, automating everything from provisioning to patching. With just-in-time scaling, it dynamically adjusts resources based on workload demands, minimizing operational overhead.

→Automate cluster management to reduce operational overhead.
→Leverage just-in-time scaling to provision capacity based on workload demands.

5 min read·AWS Containers Blog

kubernetesautoscalingPractitioner

Kubernetes v1.36: Mastering In-Place Vertical Scaling for Pods

Kubernetes v1.36 introduces a game-changing feature: in-place vertical scaling for pod-level resources. This allows you to adjust resource budgets without container restarts, streamlining your operations. Dive into how this works and what you need to know to leverage it effectively.

→Utilize the InPlacePodLevelResourcesVerticalScaling feature to adjust resource budgets without restarts.
→Set the restartPolicy to NotRequired for minimal disruption during resource updates.

5 min read·Kubernetes Blog

kubernetesautoscalingPractitioner

KEDA in Action: Dynamic Autoscaling for Kubernetes

KEDA transforms how you scale applications in Kubernetes by responding to real-world events. With components like ScaledObjects and TriggerAuthentication, it offers a robust solution for dynamic resource management.

→Utilize ScaledObjects to link your app to external event sources for dynamic scaling.
→Configure TriggerAuthentication to securely access event sources with environment variables or cloud-specific credentials.

5 min read·Official Docs

kubernetesautoscalingPractitioner

Mastering In-Place Resizing of Kubernetes Container Resources

Need to adjust CPU and memory for your Kubernetes containers? Learn how to resize resources in place without downtime. Discover the critical role of the resizePolicy in managing container behavior during updates.

→Request a resize by updating the desired resources in the Pod's spec.
→Use `kubectl patch` with the `--subresource=resize` flag for in-place resizing.

5 min read·Kubernetes Docs

kubernetesautoscalingPractitioner

HPA in Production: What the Docs Don't Tell You

Scaling workloads in Kubernetes is crucial for performance and cost efficiency. The Horizontal Pod Autoscaler (HPA) automatically adjusts the number of pods based on CPU utilization, but there are nuances to consider. Dive into the specifics of HPA and how to avoid common pitfalls.

→Configure HPA with a target average CPU utilization of 50% for optimal performance.
→Set minimum and maximum replicas to control scaling behavior effectively.

5 min read·Kubernetes Docs

kubernetesautoscalingPractitioner

HPA in Production: What the Docs Don't Tell You

Horizontal Pod Autoscaling (HPA) is a game-changer for managing workloads in Kubernetes. It automatically scales your Pods to match demand, but there are critical nuances you need to grasp for effective implementation. Dive in to learn how to configure it properly and avoid common pitfalls.

→Configure the sync period with --horizontal-pod-autoscaler-sync-period to optimize responsiveness.
→Set resource requests for all containers to ensure accurate CPU utilization metrics.

5 min read·Kubernetes Docs

kubernetesautoscalingPractitioner

Mastering Autoscaling in Kubernetes: HPA, VPA, and Beyond

Autoscaling is crucial for maintaining application performance and resource efficiency in Kubernetes. With tools like HorizontalPodAutoscaler and VerticalPodAutoscaler, you can dynamically adjust your workloads based on real-time metrics. This article dives into how these components work and what you need to watch out for in production.

→Implement HorizontalPodAutoscaler to adjust replicas based on CPU or memory usage.
→Install the Metrics Server for VerticalPodAutoscaler to function correctly.

5 min read·Kubernetes Docs

kubernetesautoscalingPractitioner

AWS EKS Innovations: Powering Kubernetes at KubeCon EU 2026

AWS is pushing the boundaries of Kubernetes with innovations like the EKS Provisioned Control Plane and Seekable OCI Parallel Pull mode. These advancements promise to enhance performance and scalability, making it easier to manage large-scale workloads.

→Utilize Amazon EKS to manage up to 100K worker nodes in a single cluster for large-scale applications.
→Implement the EKS Provisioned Control Plane for predictable performance during peak demand.

5 min read·AWS Containers Blog

Better StackSponsor

Unified observability — logs, uptime monitoring, and on-call in one place. Used by 50,000+ engineering teams to ship faster and sleep better.

Try Better Stack free →

Get the daily digest

One email. 5 articles. Every morning.

No spam. Unsubscribe anytime.

Back to Kubernetes