Learn/Kubernetes/Scheduling & Resource Management

Kubernetes

Scheduling & Resource Management

15 articles from official documentation

Practitioner15 articles

kubernetesschedulingPractitioner

Mastering Workload-Aware Scheduling in Kubernetes v1.36

Kubernetes v1.36 introduces powerful workload-aware scheduling features that can transform how you deploy applications. With the new Workload and PodGroup APIs, you can prevent resource wastage and deadlocks through gang scheduling. This is a game changer for managing complex workloads effectively.

→Utilize the Workload API as a static template for better workload management.
→Implement the PodGroup API to define runtime objects for your applications.

5 min read·Kubernetes Blog

kubernetesschedulingPractitioner

Unlocking Kubernetes v1.36: PSI Metrics for Proactive Resource Management

Kubernetes v1.36 introduces Pressure Stall Information (PSI) metrics, a game changer for monitoring resource saturation. With cumulative totals and moving averages, you can now detect issues before they escalate into outages.

→Leverage PSI metrics to identify resource saturation before outages occur.
→Ensure your kernel is compiled with CONFIG_PSI=y to collect accurate PSI data.

5 min read·Kubernetes Blog

kubernetesschedulingPractitioner

Unlocking Kubernetes v1.36: Dynamic Resource Allocation and Its Game-Changing Features

Kubernetes v1.36 introduces Dynamic Resource Allocation (DRA), revolutionizing how you manage hardware accelerators. With features like prioritized lists and device taints, you can optimize resource utilization and improve system reliability.

→Leverage Dynamic Resource Allocation (DRA) to manage hardware accelerators effectively.
→Use prioritized lists to define fallback preferences for device requests.

5 min read·Kubernetes Blog

kubernetesschedulingPractitioner

Unlocking Performance with Kubernetes Pod-Level Resource Managers

Kubernetes v1.36 introduces Pod-Level Resource Managers, a game changer for performance-sensitive workloads. This feature allows for hybrid resource allocation models, enhancing efficiency without compromising NUMA alignment.

→Enable PodLevelResourceManagers and PodLevelResources feature gates for hybrid resource allocation.
→Configure the topology manager with a policy other than none for effective resource management.

5 min read·Kubernetes Blog

kubernetesschedulingPractitioner

Mastering Memory QoS in Kubernetes v1.36: Tiered Memory Protection Explained

Kubernetes v1.36 introduces Memory QoS, a game-changer for managing container memory. This feature leverages cgroup v2 to provide tiered memory protection, ensuring your critical workloads get the resources they need without starving others.

→Enable Memory QoS by setting featureGates.MemoryQoS to true in your kubelet configuration.
→Use memory.min for Guaranteed Pods to ensure hard memory protection.

5 min read·Kubernetes Blog

kubernetesschedulingPractitioner

Mastering Mutable Pod Resources in Suspended Kubernetes Jobs

Kubernetes v1.36 introduces a game-changing feature that allows you to modify resource requests and limits for suspended Jobs. This capability is crucial for optimizing resource allocation without disrupting running workloads. Learn how to leverage this in your production environment.

→Leverage the Mutable Pod Resources feature to adjust resource requests and limits for suspended Jobs.
→Use `spec.suspend: true` in your Job manifest to create a suspended Job.

5 min read·Kubernetes Blog

kubernetesschedulingPractitioner

Mastering Resource Bin Packing in Kubernetes Scheduling

Resource bin packing is crucial for optimizing resource utilization in Kubernetes. By leveraging strategies like MostAllocated and RequestedToCapacityRatio, you can ensure your pods are scheduled efficiently. Dive into the specifics of scoring strategies and real-world configurations.

→Understand the MostAllocated strategy to prioritize nodes with higher resource allocation.
→Utilize the RequestedToCapacityRatio to customize resource scoring based on your specific needs.

5 min read·Kubernetes Docs

kubernetesschedulingPractitioner

Mastering Resource Management for Kubernetes Pods

Effective resource management is crucial for Kubernetes performance and stability. Understanding how to set resource requests and limits can prevent resource contention and ensure your applications run smoothly. Dive into the specifics of CPU and memory management to optimize your deployments.

→Define resource requests to guide the kube-scheduler in Pod placement.
→Set resource limits to prevent containers from consuming excessive resources.

5 min read·Kubernetes Docs

kubernetesschedulingPractitioner

Mastering Node-Pressure Eviction in Kubernetes

Node-pressure eviction is crucial for maintaining cluster stability under resource constraints. When memory dips below a threshold, the kubelet proactively terminates pods to reclaim resources. Understanding this process can save you from unexpected downtime.

→Configure soft eviction thresholds to manage pod terminations gracefully.
→Set hard eviction thresholds to prevent critical resource exhaustion.

5 min read·Kubernetes Docs

kubernetesschedulingPractitioner

Mastering Pod Priority and Preemption in Kubernetes

Kubernetes scheduling can be a nightmare without proper prioritization. Pod Priority and Preemption allow you to control which workloads get resources first, ensuring critical applications run smoothly. Learn how to configure PriorityClasses and leverage preemption effectively.

→Define PriorityClasses to manage Pod scheduling effectively.
→Use the `preemptionPolicy` to control whether higher-priority Pods can evict lower-priority ones.

5 min read·Kubernetes Docs

kubernetesschedulingPractitioner

Mastering Kubernetes Scheduling: The Mechanics Behind Pod Placement

Kubernetes scheduling is crucial for efficient resource management in your clusters. Understanding how kube-scheduler filters and scores nodes can significantly impact your application's performance and reliability. Dive into the mechanics that drive effective pod placement.

→Understand kube-scheduler's role in the control plane for effective scheduling.
→Utilize filtering to identify feasible nodes based on resource availability.

5 min read·Kubernetes Docs

kubernetesschedulingPractitioner

Mastering Node Readiness Controller: Ensuring Node Health in Kubernetes

The Node Readiness Controller is a game changer for managing node readiness in Kubernetes. It uses NodeReadinessRule (NRR) to enforce custom health signals, ensuring workloads are only scheduled on fully prepared nodes. Dive in to learn how to implement this effectively in your cluster.

→Define NodeReadinessRule (NRR) to set custom readiness gates for your nodes.
→Choose between 'continuous enforcement' and 'bootstrap-only enforcement' based on your needs.

5 min read·Kubernetes Blog

kubernetesschedulingPractitioner

Mastering the Shift: cgroup v1 CPU Shares to v2 CPU Weight in Kubernetes

The transition from cgroup v1 to cgroup v2 is more than just a version upgrade; it fundamentally changes how CPU resources are allocated. With the introduction of a new conversion formula for CPU weight, understanding this shift is crucial for optimizing your Kubernetes workloads.

→Understand the new formula for CPU weight: $$cpu.weight = \lceil 10^{(L^{2}/612 + 125L/612 - 7/34)} \rceil$$.
→Recognize that cgroup v1 CPU shares are replaced by CPU weight in cgroup v2.

5 min read·Kubernetes Blog

kubernetesschedulingPractitioner

Unlocking Mutable PersistentVolume Node Affinity in Kubernetes v1.35

Kubernetes v1.35 introduces mutable PersistentVolume (PV) node affinity, allowing you to adjust node affinity dynamically. This flexibility can help optimize resource allocation and improve scheduling efficiency in your clusters.

→Enable the MutablePVNodeAffinity feature gate on the APIServer to use this functionality.
→Adjust PV node affinity dynamically to optimize resource allocation and scheduling.

5 min read·Kubernetes Blog

kubernetesschedulingPractitioner

Kubernetes v1.35: Elevate Scheduling with Extended Toleration Operators

Kubernetes v1.35 introduces Extended Toleration Operators, allowing for more nuanced scheduling decisions based on numeric comparisons. With the new Gt and Lt operators, you can now make threshold-based scheduling decisions that adapt to real-time metrics.

→Utilize Gt and Lt operators for threshold-based scheduling decisions.
→Ensure numeric values are positive 64-bit integers without leading zeros.

5 min read·Kubernetes Blog

Better StackSponsor

Unified observability — logs, uptime monitoring, and on-call in one place. Used by 50,000+ engineering teams to ship faster and sleep better.

Try Better Stack free →

Get the daily digest

One email. 5 articles. Every morning.

No spam. Unsubscribe anytime.

Back to Kubernetes