kubernetesobservabilityPractitioner

Mastering the Kubernetes Resource Metrics Pipeline

5 min read Kubernetes DocsApr 23, 2026Reviewed for accuracy

Practitioner — Hands-on experience recommended

The Resource Metrics Pipeline exists to enable efficient autoscaling in Kubernetes. By providing real-time CPU and memory metrics, it allows workloads to automatically adjust based on demand. This is crucial for maintaining performance and resource efficiency in dynamic environments.

At the core of this pipeline is the metrics-server, which collects resource metrics from each kubelet in the cluster. It queries nodes over HTTP, aggregates the data, and exposes it through the Metrics API. This API is then utilized by Horizontal Pod Autoscaler (HPA) and Vertical Pod Autoscaler (VPA) to scale workloads appropriately. For instance, you can retrieve node metrics using commands like kubectl get --raw "/apis/metrics.k8s.io/v1beta1/nodes/minikube" | jq '.' to see real-time resource usage.

In production, you must ensure that the metrics-server is deployed and that the API aggregation layer is enabled. Be aware that the Metrics API only provides basic CPU and memory metrics, which might not be sufficient for all scaling needs. Additionally, if you're using a container runtime that doesn't support cgroups, you may run into issues with metric availability. Always check for compatibility with your specific setup to avoid surprises.

Key takeaways

→Deploy the metrics-server to access the Metrics API.
→Use the Metrics API for real-time CPU and memory metrics.
→Utilize HPA and VPA for efficient workload scaling based on metrics.
→Ensure your container runtime supports cgroups for accurate metrics.
→Enable the API aggregation layer to use the metrics.k8s.io API.

Why it matters

In production, effective autoscaling can lead to significant cost savings and improved application performance. By leveraging the Resource Metrics Pipeline, you ensure your workloads adapt to real-time demands, optimizing resource utilization.

Code examples

Bash

kubectl get --raw"/apis/metrics.k8s.io/v1beta1/nodes/minikube"| jq'.'

Bash

curl http://localhost:8080/apis/metrics.k8s.io/v1beta1/nodes/minikube

Bash

kubectl get --raw"/apis/metrics.k8s.io/v1beta1/namespaces/kube-system/pods/kube-scheduler-minikube"| jq'.'

When NOT to use this

The official docs don't call out specific anti-patterns here. Use your judgment based on your scale and requirements.

Want the complete reference?

Read official docs

Test what you just learned

Quiz questions written from this article

Take the quiz →

Better StackSponsor

Unified observability — logs, uptime monitoring, and on-call in one place. Used by 50,000+ engineering teams to ship faster and sleep better.

Try Better Stack free →

Mastering the Kubernetes Resource Metrics Pipeline

Key takeaways

Why it matters

Code examples

When NOT to use this

More on this topic

Dynamic Configuration for Cloud Native Swift Services in Kubernetes

Understanding the Kubernetes Integration Tax: Navigating Prometheus and Cilium in Production

Tracing AI Agents: Jaeger's Evolution with OpenTelemetry