ExchangeDEX+

Buy Crypto Markets Spot FuturesBTC Earn Event Center

Kubernetes v1.34 introduces a new Pod Replacement Policy for Jobs, giving engineers deterministic control over how failed Pods are rescheduled. This improves reliability, performance, and data-locality for batch workloads at scale.Kubernetes v1.34 introduces a new Pod Replacement Policy for Jobs, giving engineers deterministic control over how failed Pods are rescheduled. This improves reliability, performance, and data-locality for batch workloads at scale.

Kubernetes Adds Predictable Pod Replacement for Jobs in v1.34 Release

Author: Hackernoon

Source: Hackernoon

2025/12/08 04:10

Kubernetes has become the go-to platform for running not just long-lived services, but also batch workloads like data processing, ETL pipelines, machine learning training, CI/CD pipelines, and scientific simulations. These workloads typically rely on the Job API, which ensures that a specified number of Pods run to completion.

Until now, Kubernetes has had limited flexibility when a Job’s Pod failed or was evicted. Pod replacement behavior was often unpredictable: would the replacement Pod get scheduled on the same node? a nearby node? or anywhere in the cluster?

With Kubernetes v1.34, a new feature lands: Pod Replacement Policy for Jobs, driven by KEP-3015. This allows users to explicitly control how replacement Pods are scheduled, improving reliability, performance, and efficiency of batch workloads.

Why Pod Replacement Matters

When a Pod belonging to a Job fails (e.g., due to node drain, eviction, OOM, or hardware issue), Kubernetes creates a replacement Pod. However:

The replacement may land anywhere in the cluster.
If the Pod had local data (e.g., cached dataset, scratch disk, node-local SSD), the replacement Pod may not find it.
If the Pod had NUMA or GPU locality, the replacement might end up with suboptimal hardware.
In multi-zone clusters, scheduling a replacement Pod across zones could increase latency and cross-zone costs.

For workloads that depend on node affinity or cached state, this can be a real problem.

Current behavior:

By default, Kubernetes’ controller replaces pods as soon as they start terminating, which can lead to multiple pods running for the same task at the same time, especially in indexed Jobs. This can result in issues with workloads that require exactly one Pod per task, such as certain machine learning frameworks.

Starting replacement pods before old pods are terminated fully can cause other problems like extra cluster resources being used for running replacement pods.

Feature: Pod Replacement Policy feature

This feature, Kubernetes jobs will have two pod replacement policies to choose from:

TerminatingOrFailed (default): will create a replacement Pod as soon as the old one starts terminating.
Failed: waits until the old Pod is fully terminated and reaches the Failed state before creating a new one pod

Using policy: Failed ensures that only one Pod runs for a task at a time

:::info Quick Demo: We will try to demo Pod Replacement Policy for Jobs feature for both scenarios

:::

SCENARIO 1: default behavior TerminatingOrFailed: demo steps.

setup local kubernetes cluster (with minkube)

brew install minikube # start local cluster minikube start --kubernetes-version=v1.34.0

![fig: start kubernetes minikube server](https://miro.medium.com/v2/resize:fit:1400/1\*NnzqlUegwVbO8gWy8eZBNA.png)

# verify cluster is running kubectl get nodes # verify kubernetes version: v1.34.0

![fig: check kubernetes nodes & version](https://miro.medium.com/v2/resize:fit:770/1\*HaNAGHJ1gYfez8SO8mJk3w.png)

define k8s job config with podReplacementPolicy: TerminatingOrFailed, apply job & monitor pods

# worker-job.yaml apiVersion: batch/v1 kind: Job metadata: name: worker-job spec: completions: 2 parallelism: 1 podReplacementPolicy: TerminatingOrFailed template: spec: restartPolicy: Never containers: - name: worker image: busybox command: ["sh", "-c", "echo Running; sleep 30"]

kubectl apply -f worker-job.yaml

# monitor pods are running kubectl get pods -l job-name=worker-job

\ \

delete job pod manually and observe behavior

# delete pods associated with job:worker-job kubectl delete pod -l job-name=worker-job

scenario 2 : Delayed Replacement with Failed Policy: demo steps

define k8s job config with podReplacementPolicy: Failed, apply job & monitor pods

# worker-job-failed.yaml apiVersion: batch/v1 kind: Job metadata: name: worker-job-failed spec: completions: 2 parallelism: 1 podReplacementPolicy: Failed template: spec: restartPolicy: Never containers: - name: worker image: busybox command: ["sh", "-c", "echo Running; sleep 1000"]

# monitor pods are running kubectl get pods -l job-name=worker-job-failed

delete job pod manually and observe behavior

# delete pods associated with job:worker-job-failed kubectl delete pod -l job-name=worker-job-failed

behavior: replacement pod:worker-job-failed-q98qx is created only after the old pod:worker-job-failed-sg42q fully terminates, there is no overlap between old and new pod.

Benefits

Improved Reliability: Jobs are now self-healing. A single pod failure no longer risks halting an entire workload. This makes Kubernetes jobs more trustworthy for critical processes.
Reduced Operational Burden: Previously, operators often had to monitor jobs manually or write custom controllers/scripts to handle pod replacement. With this built-in capability, operational overhead is significantly reduced.
Efficient Resource Utilization: Failed pods that linger without progress waste CPU and memory. Automatic replacement ensures resources are recycled effectively.

Better User Experience: For developers, running jobs becomes less error-prone. Teams can focus on business logic instead of constantly monitoring for pod failures.

Best Practices

Tune restart policies: Use Never or OnFailure appropriately depending on workload characteristics.
Monitor metrics: Use Prometheus/Grafana to track pod replacement events.
Set resource requests/limits: Prevent unnecessary failures by properly sizing pods.
Validate thresholds: Ensure replacement policies are configured to avoid endless restart loops.
Test in staging: Before deploying to production, simulate pod failures in a staging cluster to observe replacement behavior.

Use Cases

Machine Learning Workloads: Training models can take hours or days, and pod failures are inevitable. Automatic replacement ensures training jobs continue without manual restarts, making ML pipelines more resilient.
Data Pipelines: ETL jobs or distributed data processing tasks often involve multiple pods running in parallel. Replacing failed pods ensures the pipeline completes successfully without operator intervention.

Takeaways

Pod replacement policy gives control over Pod creation timing to avoid overlaps, optimizes cluster resources by preventing temporary extra pods,and offers flexibility to choose the right policy for your job workloads based on your requirements and resource constraints

Reference(s)

https://kubernetes.io/blog/2025/08/27/kubernetes-v1-34-release/ \n

\ \ \ \

Disclaimer: The articles reposted on this site are sourced from public platforms and are provided for informational purposes only. They do not necessarily reflect the views of MEXC. All rights remain with the original authors. If you believe any content infringes on third-party rights, please contact service@support.mexc.com for removal. MEXC makes no guarantees regarding the accuracy, completeness, or timeliness of the content and is not responsible for any actions taken based on the information provided. The content does not constitute financial, legal, or other professional advice, nor should it be considered a recommendation or endorsement by MEXC.

Ethereum price predictions are turning heads, with analysts suggesting ETH could climb to $10,000 by 2026 as institutional demand and network upgrades drive growth. While Ethereum remains a blue-chip asset, investors looking for sharper multiples are eyeing Layer Brett (LBRETT). Currently in presale at just $0.0058, the Ethereum Layer 2 meme coin is drawing huge [...] The post Ethereum Price Prediction: ETH Targets $10,000 In 2026 But Layer Brett Could Reach $1 From $0.0058 appeared first on Blockonomi.

Blockonomi

2025/09/17 23:45

IP Hits $11.75, HYPE Climbs to $55, BlockDAG Surpasses Both with $407M Presale Surge!

The post IP Hits $11.75, HYPE Climbs to $55, BlockDAG Surpasses Both with $407M Presale Surge! appeared on BitcoinEthereumNews.com. Crypto News 17 September 2025 | 18:00 Discover why BlockDAG’s upcoming Awakening Testnet launch makes it the best crypto to buy today as Story (IP) price jumps to $11.75 and Hyperliquid hits new highs. Recent crypto market numbers show strength but also some limits. The Story (IP) price jump has been sharp, fueled by big buybacks and speculation, yet critics point out that revenue still lags far behind its valuation. The Hyperliquid (HYPE) price looks solid around the mid-$50s after a new all-time high, but questions remain about sustainability once the hype around USDH proposals cools down. So the obvious question is: why chase coins that are either stretched thin or at risk of retracing when you could back a network that’s already proving itself on the ground? That’s where BlockDAG comes in. While other chains are stuck dealing with validator congestion or outages, BlockDAG’s upcoming Awakening Testnet will be stress-testing its EVM-compatible smart chain with real miners before listing. For anyone looking for the best crypto coin to buy, the choice between waiting on fixes or joining live progress feels like an easy one. BlockDAG: Smart Chain Running Before Launch Ethereum continues to wrestle with gas congestion, and Solana is still known for network freezes, yet BlockDAG is already showing a different picture. Its upcoming Awakening Testnet, set to launch on September 25, isn’t just a demo; it’s a live rollout where the chain’s base protocols are being stress-tested with miners connected globally. EVM compatibility is active, account abstraction is built in, and tools like updated vesting contracts and Stratum integration are already functional. Instead of waiting for fixes like other networks, BlockDAG is proving its infrastructure in real time. What makes this even more important is that the technology is operational before the coin even hits exchanges. That…

BitcoinEthereumNews

2025/09/18 00:32

Crypto Prices

Ethereum

ETH

$3,216.00

$3,216.00$3,216.00

-3.53%

Bitcoin

BTC

$92,745.48

$92,745.48$92,745.48

-2.49%

Solana

SOL

$134.04

$134.04$134.04

-5.61%

XRP

$1.9517

$1.9517$1.9517

-4.83%

QUAI

$0.10789

$0.10789$0.10789

-1.96%

Kubernetes Adds Predictable Pod Replacement for Jobs in v1.34 Release

Why Pod Replacement Matters

Benefits

Best Practices

Use Cases

Takeaways

Reference(s)

You May Also Like

Zero Knowledge Proof Stage 2 Coin Burns Signal a Possible 7000x Explosion! ETH Slows Down & Pepe Drops

Ethereum Price Prediction: ETH Targets $10,000 In 2026 But Layer Brett Could Reach $1 From $0.0058

IP Hits $11.75, HYPE Climbs to $55, BlockDAG Surpasses Both with $407M Presale Surge!

Trending News

Zero Knowledge Proof Stage 2 Coin Burns Signal a Possible 7000x Explosion! ETH Slows Down & Pepe Drops

Ethereum Price Prediction: ETH Targets $10,000 In 2026 But Layer Brett Could Reach $1 From $0.0058

IP Hits $11.75, HYPE Climbs to $55, BlockDAG Surpasses Both with $407M Presale Surge!

The Alarming 80% Failure Rate And The Critical Path To Survival

What to Know About NewsNation’s New 10 PM Show

Quick Reads

How Collective Intelligence Is Reshaping Crypto's Future: The Secret Behind BEEG Prediction Market Becoming Sui Ecosystem's Ultimate Compass in 2026

2026 Passive Income Playbook: How BEEG Liquid Staking Doubles Your Sui Ecosystem Rewards

From Meme to GameFi Ruler: How BEEG is Rewriting Sui Chain Gaming Payment Standards In 2026

Why BEEG Has Become the "Hard Currency" of Sui's Social Protocol in 2026: Unveiling the Tipping Economy Revolution

XRP Price Prediction 2026: Why Seasoned Traders Choose MEXC to Position for XRP's Next Bull Run

Crypto Prices