Release Decoded: Kubernetes Release v1.34

Steve Younger
Aug 21, 2025
8 min read

Kubernetes Release: v1.34

Release Date: 08/27/2025

Kubernetes 1.34 arrives on August 27, 2025, carrying the theme of stability with progress. This release strikes a careful balance between strengthening the platform’s reliability and introducing meaningful new capabilities. Importantly, it contains no API removals or deprecations, which means enterprises can upgrade without facing breaking changes.

For production environments, Kubernetes 1.34 focuses on three major areas: smarter handling of specialized hardware, more secure and automated registry authentication, and greater control over rollout strategies. Dynamic Resource Allocation graduates to Stable, enabling efficient use of GPUs and other accelerators. ServiceAccount tokens for image pulls simplify private registry access with stronger security. A new Deployment Pod Replacement Policy, now in alpha, gives operators flexibility to prioritize either rollout speed or resource stability.

These highlights, along with quality-of-life updates across observability, autoscaling, and configuration, make v1.34 a release that enterprises can adopt with confidence. Let’s explore how these improvements strengthen Kubernetes as the foundation for modern workloads.

Dynamic Resource Allocation (Stable): Smarter Hardware Management

What It Is

Kubernetes 1.34 graduates Dynamic Resource Allocation (DRA) to Stable, making it the default framework for managing specialized hardware such as GPUs, FPGAs, and high-performance NICs. DRA introduces new API objects under the resource.k8s.io API group, including ResourceClaim, DeviceClass, ResourceClaimTemplate, and ResourceSlice. Together, these provide a consistent way to define, request, and allocate devices across a cluster.

High-Level Workflow

Cluster administrators define DeviceClasses that describe available hardware types, such as a specific GPU model. Developers can then request devices directly in Pod specifications by referencing these classes. Kubernetes automatically handles scheduling, ensuring Pods land on nodes that match the requirements.

For example, a Pod requesting a GPU from a DeviceClass might look like this:

---
apiVersion: v1
kind: Pod
metadata:
  name: gpu-fractional-job
spec:
  containers:
  - name: trainer
    image: nvidia/cuda:11.8-base
    resources:
      requests:
        nvidia.com/mig-1g.5gb: 1   # Request one MIG slice (1g.5gb partition)

Alt-text: Kubernetes Pod YAML example requesting a fractional GPU allocation using NVIDIA MIG slice (1g.5gb) with Dynamic Resource Allocation in Kubernetes 1.34.

How it works

The cluster administrator defines a DeviceClass that maps to MIG partitions (nvidia.com/mig-1g.5gb) instead of whole GPUs.
When a Pod requests one slice, Kubernetes allocates a partition of a GPU through DRA.
Multiple Pods can now share a single physical GPU, each consuming only the partition they need.

This workflow shows how fractional allocation reduces waste. Instead of locking up an entire GPU for a small training job, Kubernetes ensures workloads get precisely the resources required, while other jobs can run on the remaining partitions.

Why It Matters

Before DRA, Kubernetes treated specialized hardware as “extended resources” that were coarse-grained and inflexible. A Pod requesting a GPU often received the entire device even if it only needed a fraction of its capacity. This resulted in waste and made sharing accelerators across workloads difficult.

DRA changes that model by enabling:

Fractional allocation of devices when supported by vendors (for example, GPU partitions).
Flexible sharing policies so clusters can maximize utilization.
Built-in health checks and monitoring for hardware resources.
Fine-grained control at scale without custom device plugins for every use case.

Real-World Impact

Enterprises running AI/ML workloads, data analytics pipelines, or specialized networking functions benefit directly. Instead of statically tying a GPU to a single job, multiple Pods can share resources according to defined policies. This means higher ROI on costly hardware, fewer idle accelerators, and better scheduling fairness across teams.

For developers, the process is simpler: they request what they need, and Kubernetes ensures the right hardware is allocated. For operators, DRA reduces the need for ad hoc device management scripts and ensures consistency across environments.

Real-World Example

Imagine a financial services company running multiple machine learning workloads for fraud detection. Some jobs require full GPUs for training, while others only need a fraction of a GPU for inference. With DRA, administrators define a DeviceClass for NVIDIA MIG slices. Training jobs can request full GPUs while inference Pods request smaller MIG partitions. Kubernetes schedules each workload appropriately, ensuring that GPUs are fully utilized. This prevents the cluster from leaving half of a GPU idle while waiting for a job that can consume the whole device, translating to better hardware efficiency and lower costs.

Key Technical Notes from the KEP

According to KEP-4381:

DRA is enabled by default in Kubernetes v1.34 and requires no feature gates.
APIs have graduated to resource.k8s.io/v1, providing long-term stability.
ResourceClaims define specific allocations and are bound to Pods for lifecycle management.
ResourceSlices provide scalable descriptions of available hardware, making large clusters more efficient.
The design explicitly supports vendor extensions so hardware providers can define custom policies without breaking core Kubernetes APIs.

ServiceAccount Tokens for Image Pulls (Beta): Secure Image Pulls Made Simple

What It Is

Kubernetes 1.34 introduces a more secure and streamlined way for Pods to authenticate to private container registries. Instead of relying on manually created, long-lived image pull secrets, the kubelet now uses short-lived ServiceAccount tokens that are automatically rotated and scoped to the Pod. This feature reaches Beta in v1.34 and is enabled by default, aligning image pulls with Kubernetes’ broader shift toward workload identity and zero-trust security.

High-Level Workflow

Traditionally, administrators created Docker config secrets and attached them to Pods or Deployments as imagePullSecrets. In v1.34, the kubelet automatically presents a Pod’s ServiceAccount token to the registry. The registry validates the token (often via OIDC federation) and grants access to the image. No manual secret creation is needed, and tokens automatically expire when the Pod’s lifecycle ends.

For example, a Pod pulling from a private registry might look like this:

---
apiVersion: v1
kind: ServiceAccount
metadata:
  name: ci-cd-runner
  namespace: prod-team
---
apiVersion: v1
kind: Pod
metadata:
  name: private-image-test
  namespace: prod-team
spec:
  serviceAccountName: ci-cd-runner
  containers:
  - name: app
    image: registry.acme.com/our-app:latest

Alt-text: Kubernetes YAML showing a Pod pulling from a private registry using a ServiceAccount token in Kubernetes 1.34, without imagePullSecrets.

Notice that there is no reference to an imagePullSecret. The kubelet uses the Pod’s ServiceAccount token to authenticate directly.

Why It Matters

In earlier versions of Kubernetes, image pull secrets often became an operational burden:

Security risk: Long-lived, shared secrets could leak or expire, creating vulnerabilities.
Operational overhead: Admins had to manually distribute and rotate secrets across namespaces or clusters.
Scaling issues: More teams and more registries meant more secrets to track and maintain.

By moving to ephemeral, automatically rotated tokens, Kubernetes significantly reduces attack surface and operational friction. Each token is tied to both the ServiceAccount and the Pod, which prevents misuse outside of its intended context.

Real-World Impact

For platform teams, this change eliminates secret sprawl and reduces human error in registry authentication. Enterprises that operate at scale — especially with CI/CD pipelines and multi-tenant clusters — can now rely on Kubernetes to handle registry authentication securely without building custom automation.

Developers benefit from a smoother workflow since they no longer need to know about or reference imagePullSecrets. Their Pods simply run with the correct ServiceAccount, and Kubernetes handles the rest.

Real-World Example

Consider a GitLab CI/CD pipeline that builds application images and pushes them to a private registry such as Harbor or Artifactory. In previous Kubernetes versions, developers had to create and manage imagePullSecrets in every namespace where these images were consumed. Now, with ServiceAccount tokens, the pipeline can deploy workloads into the cluster without managing static credentials. The kubelet uses the ServiceAccount’s short-lived token to authenticate each Pod directly to the registry. This reduces setup time, eliminates secret management scripts, and ensures credentials cannot be reused or leaked outside of their intended scope.

Key Technical Notes

According to the v1.34 release documentation:

Beta and enabled by default: New clusters get this behavior immediately, though existing clusters can opt-in.
Feature gates: Backed by KubeletCredentialProviders and ServiceAccountTokenVolumeProjection (both Beta or GA in v1.34).
OIDC alignment: Tokens follow OIDC standards, making it easier for registries to validate credentials securely.
Scoped lifecycle: Tokens automatically expire with the Pod, eliminating leftover credentials.

Deployment Pod Replacement Policy (Alpha): Faster vs. Safer Rollouts, You Choose

What It Is

Kubernetes 1.34 introduces a new Pod Replacement Policy for Deployments as an alpha feature. This enhancement gives operators explicit control over how Pods are replaced during rolling updates and scale-down events. Until now, Kubernetes always waited for Pods to fully terminate before creating replacements, which ensured stability but often slowed updates. The new policy lets you decide whether to prioritize faster rollouts or stricter resource control.

High-Level Workflow

By setting the .spec.podReplacementPolicy field in a Deployment, operators can select one of two behaviors:

TerminationStarted: New Pods begin launching as soon as old Pods start terminating. Rollouts complete faster but may temporarily increase resource usage.
TerminationComplete: New Pods only launch after old Pods have fully terminated. Rollouts are slower but resource usage never exceeds the desired replica count.

This flexibility allows teams to align update strategies with the specific needs of their workloads and cluster capacity.

Why It Matters

Rolling updates are a cornerstone of Kubernetes application management, but the default “one-size-fits-all” approach created challenges:

For high-traffic applications, slow rollouts delay new features and fixes.
For resource-heavy workloads, launching new Pods too quickly could overwhelm cluster capacity.

The Pod Replacement Policy removes these trade-offs by letting operators choose the behavior that best fits their environment. It makes rollouts predictable, tunable, and aligned with business priorities instead of forcing a compromise.

Real-World Impact

This feature directly benefits enterprise teams managing production applications at scale. For example:

A SaaS company can use TerminationStarted for stateless web services, ensuring faster feature delivery without noticeable downtime.
A data analytics platform can stick with TerminationComplete for workloads that run large memory- or GPU-intensive Pods, avoiding resource contention during updates.

By allowing teams to optimize on a per-Deployment basis, Kubernetes makes update strategies a deliberate decision rather than an implicit default.

Real-World Example

Imagine an e-commerce company rolling out a new backend service before a major holiday sale. Each Pod requires 60 seconds to shut down due to cleanup tasks. With the old behavior (TerminationComplete), the rollout of 10 replicas could take more than 10 minutes. By switching to TerminationStarted, Kubernetes begins launching replacement Pods as soon as termination begins, allowing the rollout to complete in half the time. This ensures the new version is live quickly while the cluster absorbs the short-lived resource surge.

---
apiVersion: apps/v1
kind: Deployment
metadata:
  name: web-backend
spec:
  replicas: 10
  strategy:
    type: RollingUpdate
    rollingUpdate:
      maxSurge: 3
      maxUnavailable: 2
  podReplacementPolicy: TerminationStarted
  template:
    metadata:
      labels:
        app: web-backend
    spec:
      containers:
      - name: app
        image: acme/web-backend:v2

Alt-text: Kubernetes Deployment YAML example using the new Pod Replacement Policy with TerminationStarted in Kubernetes 1.34, enabling faster rollout of 10 replicas.

Key Technical Notes

Alpha status: Disabled by default in Kubernetes 1.34.
Feature gates: Requires DeploymentPodReplacementPolicy and DeploymentReplicaSetTerminatingReplicas enabled on both the API server and controller manager.
Default behavior: Remains equivalent to TerminationComplete until explicitly configured.
Use with caution: As with all alpha features, production use should be carefully evaluated.

Conclusion: Stability with Progress

Kubernetes 1.34 proves that innovation does not have to come at the cost of stability.

Kubernetes 1.34 continues the project’s steady evolution, delivering meaningful new capabilities without introducing breaking changes. The graduation of Dynamic Resource Allocation makes specialized hardware easier to use and more efficient. The introduction of ServiceAccount tokens for image pulls strengthens security while simplifying registry authentication. And the new Pod Replacement Policy for Deployments gives operators long-awaited flexibility to control rollout strategies. Together, these updates strike the balance between stability and innovation that enterprises depend on.

Beyond these headline features, the release also includes several quality-of-life improvements:

Kubelet and API server tracing have graduated to Stable, enabling end-to-end observability with OpenTelemetry.
A preview of KYAML introduces a stricter YAML format that reduces common configuration errors.
The Horizontal Pod Autoscaler now supports configurable thresholds for scale-up and scale-down events.
Usability updates include more flexible DNS label validation and support for fully qualified domain names as Pod hostnames.

Taken together, Kubernetes 1.34 enhances security, efficiency, and operational control while leaving the upgrade path smooth. For organizations running production clusters, it is a release worth adopting with confidence.

Release Decoded: Kubernetes Release v1.34

Dynamic Resource Allocation (Stable): Smarter Hardware Management

What It Is

High-Level Workflow

How it works

Why It Matters

Real-World Impact

Real-World Example

Key Technical Notes from the KEP

ServiceAccount Tokens for Image Pulls (Beta): Secure Image Pulls Made Simple

What It Is

High-Level Workflow

Why It Matters

Real-World Impact

Real-World Example

Key Technical Notes

Deployment Pod Replacement Policy (Alpha): Faster vs. Safer Rollouts, You Choose

What It Is

High-Level Workflow

Why It Matters

Real-World Impact

Real-World Example

Key Technical Notes

Conclusion: Stability with Progress

Recent Posts

Comments

Stay Up to Date with New Content