DiscoverKubeFMThe Data Engineer's guide to optimizing Kubernetes, with Niels Claeys
The Data Engineer's guide to optimizing Kubernetes, with Niels Claeys

The Data Engineer's guide to optimizing Kubernetes, with Niels Claeys

Update: 2025-10-14
Share

Description

Niels Claeys shares how his team at Dataminded built Conveyor, a data platform processing up to 1.5 million core hours monthly. He explains the specific optimizations they discovered through production experience, from scheduler changes that immediately reduce costs by 10-15% to achieving 97% spot instance usage without reliability issues.

You will learn:

  • Why the default Kubernetes scheduler wastes money on batch workloads and how switching from "least allocated" to "most allocated" scheduling enables faster scale-down and better resource utilization

  • How to achieve 97% spot instance adoption through strategic instance type diversification, region selection, and Spark-specific techniques

  • Node pool design principles that balance Kubernetes overhead with workload efficiency

  • Platform-specific gotchas like AWS cross-AZ data transfer costs that can spike bills unexpectedly

Sponsor

This episode is brought to you by Testkube—where teams run millions of performance tests in real Kubernetes infrastructure. From air-gapped environments to massive scale deployments, orchestrate every testing tool in one platform. Check it out at testkube.io

More info

Comments 
In Channel
loading
00:00
00:00
x

0.5x

0.8x

1.0x

1.25x

1.5x

2.0x

3.0x

Sleep Timer

Off

End of Episode

5 Minutes

10 Minutes

15 Minutes

30 Minutes

45 Minutes

60 Minutes

120 Minutes

The Data Engineer's guide to optimizing Kubernetes, with Niels Claeys

The Data Engineer's guide to optimizing Kubernetes, with Niels Claeys