ahmet alp balkan
banner
ahmet.dev
ahmet alp balkan
@ahmet.dev
working on compute infra at LinkedIn.
ex-twitter/googlecloud/azure
dist sys enthusiast
github.com/ahmetb
#kubecon: roughly 14% flights into atlanta cancelled today, even more predicted for sunday/monday. www.flightaware.com/live/cancell...
November 9, 2025 at 4:50 AM
#kubecon talk finally coming together
see you all there Thursday 11:45am or in the hallway
kccncna2025.sched.com/event/27Fdd/...
November 9, 2025 at 4:45 AM
Reposted by ahmet alp balkan
I can't wait for the MOST cool, relaxed part of geeking out at @cncf.bsky.social #kubecon, which is actually hitting @rejekts.io!!! cloud-native.rejekts.io

Join us if you're around #atlanta or at KubeCon!
Cloud Native Rejekts
cloud-native.rejekts.io
November 3, 2025 at 4:45 PM
Reposted by ahmet alp balkan
Most people think the Kubernetes scheduler makes scheduling decisions itself. Well, it doesn’t.

The real power lies elsewhere.

🧵 Let’s talk about how the kube-scheduler is actually a plugin-first orchestration engine, not a decision-maker.
October 6, 2025 at 7:21 PM
KubeCon goers: what are the Atlanta hotels you're booking that's not next to the boring downtown/convention center area?
September 26, 2025 at 4:10 AM
Anyone have recommended readings on any scalable k8s multi-cluster work API implementations? Planning to roll out a similar model on our own and I want to study the state of the art and the rationale behind the existing model.
August 5, 2025 at 5:55 PM
#KubeCon schedule is out! Team LinkedIn will give 3 talks this year (myself included):
1️⃣ Making Application Rollouts Observable (lnkd.in/gmnM-8Z4)
2️⃣ How We Upgrade the Whole ML Training Fleet Bi-weekly(lnkd.in/gG_a_Etc)
3️⃣ Evicted! All the Ways Kubernetes Kills Your Pods
(lnkd.in/ggm9HKZF)
August 1, 2025 at 10:24 PM
Our team at LinkedIn will give 3 talks at the #KubeCon 2025! Looks like we'll be meeting in Atlanta again. 👋🏼 Stay tuned.
July 30, 2025 at 3:36 AM
Does anyone know why all cloud-native open source ecosystem uses pull model to expose their /metrics (instead of pushing the metrics)? Pull model is really poor when the metric cardinality exposed on a single endpoint increases.
July 29, 2025 at 8:41 PM
Reposted by ahmet alp balkan
We’ve been systematically evaluating the scalability of our fleet’s control planes, and the recent work on storage layer scalability in the Kubernetes community is going to prove pivotal as we scale further!

Check out what we’ve been upto for the past couple of weeks ♥️
As we test Kubernetes clusters beyond the supported 5,000+ nodes limit at LinkedIn, we're finding out 'List API' calls are a bottleneck in cluster reliability. So I'm publishing a deep dive about why this is a problem and what you can do about them. ahmet.im/blog/kuberne...
July 9, 2025 at 10:13 PM
As we test Kubernetes clusters beyond the supported 5,000+ nodes limit at LinkedIn, we're finding out 'List API' calls are a bottleneck in cluster reliability. So I'm publishing a deep dive about why this is a problem and what you can do about them. ahmet.im/blog/kuberne...
July 9, 2025 at 8:38 PM
Open sourcing a new kubectl plugin called `kubectl node-resource` that queries allocations/utilizations for all/subset of the Kubernetes nodes. It has optimizations for large clusters. We've needed this in our set up to quickly answer questions. github.com/ahmetb/kubec...
June 23, 2025 at 9:44 PM
Glad to have @madhavjivrajani.bsky.social join us at LinkedIn compute team this summer. I’m constantly impressed by the depth of his kube-apiserver expertise.
June 23, 2025 at 4:36 PM
DAYS SINCE ETCD BACKUPS SAVED OUR ASS IN PRODUCTION: 0

If you manage your Kubernetes clusters and think you won’t lose your etcd, you’re almost certainly wrong. It takes years for the disaster to happen and you’ll be glad you have invested in building a backup/restore system.
May 29, 2025 at 8:21 PM
@kelseyhightower.com hi Kelsey, what was your toto model again? can’t find after the twitter nuke ☹️
May 11, 2025 at 5:42 PM
Reposted by ahmet alp balkan
@ahmet.dev and I shared our approach at #KubeConEU25 couple weeks ago with ~500 attendees (though the credit belongs to the entire team behind the platform). Great to see it resonate!

Watch here: youtu.be/dDkXFuy45EA?...
From Metal To Apps: LinkedIn’s Kubernetes-based Compute Platform - Ahmet Alp Balkan & Ronak Nathani
YouTube video by CNCF [Cloud Native Computing Foundation]
youtu.be
April 16, 2025 at 3:10 AM
Finally delivered our talk at #KubeCon to 500 attendees!
Slides are up at kccnceu2025.sched.com/event/1txGQ/ and recording will be coming soon. If you wanna chat, I'm at the conference today.
April 4, 2025 at 9:36 AM
Here we go again, this time in London.
#kubecon maintainer summit
March 31, 2025 at 8:16 AM
🇬🇧 Looking forward to seeing y'all at our #KubeCon talk about how we're building a bare-metal compute platform at LinkedIn to run thousands of services on high-6-digit number of machines!

Schedule: sched.co/1txGQ
March 27, 2025 at 11:36 PM
Reposted by ahmet alp balkan
Our discussion for the @kubernetespodcast.com with @ahmet.dev and @ronaknathani.bsky.social of is finally live 🥳

I had a great time talking to these awesome people about how to leverage #k8s inside LinkedIn and what they learned along the way 🙌

Take a listen 👇

#KubernetesPodcast #k8s
March 27, 2025 at 3:17 PM
Reposted by ahmet alp balkan
This episode is a real treat- @ahmet.dev and @ronaknathani.bsky.social of @linkedin.com share lessons from running Kubernetes in production! Learn about the custom tools they've built for running Stateful workloads, how many clusters is the right number, & more!
kubernetespodcast.com/episode/249-...
Kubernetes Podcast from Google: Episode 249 - Kubernetes at LinkedIn, with Ahmet Alp Balkan and Ronak Nathani
Ahmet Alp Balkan and Ronak Nathani are software engineers at LinkedIn compute infrastructure team running the Kubernetes platform for LinkedIn and they joined us today to talk about how they run Kuber...
kubernetespodcast.com
March 25, 2025 at 4:48 PM
Looking forward to seeing y'all at #KubeCon London next week! (Who's going?)
March 27, 2025 at 12:56 AM
New article on my blog that explains all the various ways Kubernetes can evict your workloads. This wasn't all documented in one place so I'm hoping this helps the teams operating with Kubernetes in production. You'll also find a cheatsheet in the article. ahmet.im/blog/kuberne...
February 27, 2025 at 8:32 PM
Reposted by ahmet alp balkan
Selam Türkiye 🇹🇷

Our Google /Dev/Cloud event is coming to Istanbul on March 13 🥳. We will have a full day of technical content and hands-on workshops about Google Cloud, GenAI, Cloud Runtimes, and more. Organized by the Cloud DevRel Team for developers 👨‍🏫 👩‍🏫 ⌨️ .
Google /dev/cloud day Istanbul
Looking to boost your AI and cloud skills? Join us for Google /dev/cloud day Istanbul! Learn, connect, and get hands-on experience. Register today.
cloud.google.com
February 15, 2025 at 6:49 PM
I might _attempt_ to write a blog about all the different things that causes Kubernetes evicts pods. There are so many, and every single one works differently.
February 8, 2025 at 9:58 PM